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About This Manual 


Preface 


Read This First 


The TMS320C54x is a fixed-point digital signal processor (DSP) in the 
TMS320 family. This book, the first volume of a 5-volume set, serves as a refer- 
ence for the TMS320C54x DSP and provides information for developing hard- 
ware and software applications using the 54x. Unless otherwise specified, all 
references to the ’54x apply to the TMS320C54x, the TMS320LC54x, and the 
TMS320VC54x. 


Many device references are shown with an apostrophe (’ ) replacing the usual 
alphanumeric prefix (ex. 549 instead of TMS320C549). You should consult 
the device-specific datasheet if you need more information about the device’s 
nomenclature. 


This user’s guide contains limited information about the enhanced peripherals 
available on the ’5402, 5410, and ’5420 devices. For detailed information on 
the enhanced peripherals, see volume 5 of this reference set: TMS320C54x 
DSP, Enhanced Peripherals, literature number SPRU302. 


For a summary of updates in this book, see Appendix D, Summary of Updates 
in This Document. 


How to Use This Manual 


The table below summarizes the 54x information contained in this book. 
(For a complete listing, refer to the Table of Contents.) 


If you are looking for 
information about: Turn to these chapters: 
Addressing modes Chapter 5, Data Addressing 


Chapter 6, Program Memory Addressing 


Buffered serial port Chapter 9, Serial Ports 
Bus structure Chapter 2, Architectural Overview 
Clock generator Chapter 2, Architectural Overview 


Chapter 8, On-Chip Peripherals 


How to Use This Manual 


If you are looking for 
information about: 


CPU architecture 


External bus 

Hold mode 

Host port interface 
Interrupts 


Memory 


On-chip peripherals 
Overview of the ’54x 


Parallel I/O Ports 


Power-down modes 
Program control 

Pipeline latencies 

Reset 

ROM code submission to TI 
Serial ports 

Status registers 

TDM serial port 


Timer 


Wait-state generator 


Turn to these chapters: 


Chapter 2, Architectural Overview 


Chapter 4, Central Processing Unit 
Chapter 10, External Bus Operation 
Chapter 10, External Bus Operation 
Chapter 8, On-Chip Peripherals 

Chapter 6, Program Memory Addressing 


Chapter 2, Architectural Overview 


Chapter 3, Memory 
Chapter 8, On-Chip Peripherals 
Chapter 1, /ntroduction 


Chapter 2, Architectural Overview 


Chapter 8, On-Chip Peripherals 

Chapter 6, Program Memory Addressing 
Chapter 6, Program Memory Addressing 
Chapter 7, Pipeline 

Chapter 6, Program Memory Addressing 
Appendix C, Submitting ROM Codes to TI 
Chapter 9, Serial Ports 

Chapter 4, Central Processing Unit 
Chapter 9, Serial Ports 


Chapter 2, Architectural Overview 


Chapter 8, On-Chip Peripherals 


Chapter 2, Architectural Overview 


Chapter 8, On-Chip Peripherals 


Notational Conventions / Information About Cautions 


Notational Conventions 
This book uses the following conventions. 


(1 The TMS320C54x DSP can use either of two forms of the instruction set: 
amnemonic form or an algebraic form. This book uses the mnemonic form 
of the instruction set. For information about the mnemonic form of the 
instruction set, see TMS320C54x DSP Reference Set, Volume 2: Mne- 
monic Instruction Set. For information about the algebraic form of the 
instruction set, see TMS320C54x DSP Reference Set, Volume 3: Algebra- 
ic Instruction Set. 


Lj Program listings and program examples are shown ina special type- 
face. 


Here is a segment of a program listing: 


STL A, *AR1+ ; Int_RAM(I)=0 
RSBX INTM ;Globally enable interrupts 
B MAIN_PG ,;Return to foreground program 


(1 Square brackets, [ and ], identify an optional parameter. If you use an op- 
tional parameter, specify the information within the brackets; do not type 
the brackets themselves. 


Information About Cautions 


This book contains cautions. 


This is an example of a caution statement. 


A caution statement describes a situation that could potentially 


damage your software or equipment. 


The information in a caution is provided for your protection. Please read each 
caution carefully. 
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Related Documentation from Texas Instruments 


Related Documentation from Texas Instruments 


The following books describe the ’54x and related support tools. To obtain a 
copy of any of these Tl documents, call the Texas Instruments Literature 
Response Center at (800) 477-8924. When ordering, please identify the book 
by its title and literature number. Many of these documents are located on the 
internet at http:/Awww.ti.com. Click DSPS Solutions, then click DSP Literature. 


TMS320C54x DSP Reference Set is composed of five volumes that can be 
ordered as a set with literature number SPRU210. To order an individual 
book, use the document-specific literature number: 


TMS320C54x DSP Reference Set, Volume 1: CPU and Peripherals 
(literature number SPRU131) describes the TMS320C54x 16-bit 
fixed-point general-purpose digital signal processors. Covered are 
its architecture, internal register structure, data and program 
addressing, the instruction pipeline, and on-chip peripherals. Also 
includes development support information, parts lists, and design 
considerations for using the XDS510 emulator. 


TMS320C54x DSP Reference Set, Volume 2: Mnemonic Instruction 
Set (literature number SPRU172) describes the TMS320C54x 
digital signal processor mnemonic instructions individually. Also 
includes a summary of instruction set classes and cycles. 


TMS320C54x DSP Reference Set, Volume 3: Algebraic Instruction 
Set (literature number SPRU179) describes the TMS320C54x 
digital signal processor algebraic instructions individually. Also 
includes a summary of instruction set classes and cycles. 


TMS320C54x DSP Reference Set, Volume 4: Applications Guide 
(literature number SPRU173) describes software and hardware 
applications for the TMS320C54x digital signal processor. Also 
includes development support information, parts lists, and design 
considerations for using the XDS510 emulator. 


TMS320C54x DSP Reference Set, Volume 5: Enhanced Peripherals 
(literature number SPRU302) describes the enhanced peripherals 
available on the TMS320C54x digital signal processors. Includes 
the multichannel buffered serial ports (McBSPs), direct memory 
access (DMA) controller, the HPI-8 and HPI-16 host port inter- 
faces, and the interprocessor communication module. 


TMS320C54x, TMS320LC54x, TMS320VC54x Fixed-Point Digital Signal 
Processors (literature number SPRSO039) data sheet contains the 
electrical and timing specifications for these devices, as well as signal 
descriptions and pinouts for all of the available packages. 


vi 


Related Documentation from Texas Instruments 


TMS320C54x DSKplus User’s Guide (literature number SPRU191) 
describes the TMS320C54x digital signal processor starter kit (DSK), 
which allows you to execute custom ’C54x code in real time and debug it 
line by line. Covered are installation procedures, a description of the 
debugger and the assembler, customized applications, and initialization 
routines. 

TMS320C54x Assembly Language Tools User’s Guide (literature number 
SPRU102) describes the assembly language tools (assembler, linker, 
and other tools used to develop assembly language code), assembler di- 
rectives, macros, common object file format, and symbolic debugging di- 
rectives for the ’C54x generation of devices. 


TMS320C5xx C Source Debugger User’s Guide (literature number 
SPRUO99) tells you how to invoke the ’C54x emulator, evaluation 
module, and simulator versions of the C source debugger interface. This 
book discusses various aspects of the debugger interface, including 
window management, command entry, code execution, data 
management, and breakpoints. It also includes a tutorial that introduces 
basic debugger functionality. 


TMS320C54x Code Generation Tools Getting Siarted Guide (literature 
number SPRU147) describes how to install the TMS320C54x assembly 
language tools and the C compiler for the ’C54x devices. The installation 
for MS-DOS™, OS/2™, SunOS™, Solaris™, and HP-UX™ 9.0x systems 
is covered. 


TMS320C54x Evaluation Module Technical Reference (literature number 
SPRU135) describes the ’C54x evaluation module, its features, design 
details and external interfaces. 


TMS320C54x Optimizing C Compiler User’s Guide (literature number 
SPRU103) describes the ’C54x C compiler. This C compiler accepts 
ANSI standard C source code and produces TMS320 assembly lan- 
guage source code for the 'C54x generation of devices. 


TMS320C54x Simulator Getting Started (literature number SPRU137) de- 
scribes how to install the TMS320C54x simulator and the C source 
debugger for the ‘C54x. The installation for MS-DOS™, PC-DOS™, 
SunOS™, Solaris™, and HP-UX™ systems is covered. 


TMS320 Third-Party Support Reference Guide (literature number 
SPRU052) alphabetically lists over 100 third parties that provide various 
products that serve the family of TMS320 digital signal processors. A 
myriad of products and applications are offered—software and hardware 
development tools, speech recognition, image processing, noise can- 
cellation, modems, etc. 
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Related Documentation from Texas Instruments / Technical Articles 


Technical Articles 


viii 


TMS320C548/C549 Bootloader Technical Reference (literature number 
SPRU288) describes the process the bootloader uses to transfer user 
code from an external source to the program memory at power up. (Pres- 
ently available only on the internet.) 


TMS320 DSP Development Support Reference Guide (literature number 
SPRUO011) describes the TMS320 family of digital signal processors and 
the tools that support these devices. Included are code-generation tools 
(compilers, assemblers, linkers, etc.) and system integration and debug 
tools (simulators, emulators, evaluation modules, etc.). Also covered are 
available documentation, seminars, the university program, and factory 
repair and exchange. 


Awide variety of related documentation is available on digital signal processing. 
These references fall into one of the following application categories: 


General-Purpose DSP 
Graphics/Imagery 
Speech/Voice 

Control 

Multimedia 

Military 
Telecommunications 
Automotive 
Consumer 

Medical 

Development Support 


OUUOUUUUUOULU 


In the following list, references appear in alphabetical order according to 
author. The documents contain beneficial information regarding designs, op- 
erations, and applications for signal-processing systems; all of the documents 
provide additional references. Texas Instruments strongly suggests that you 
refer to these publications. 


General-Purpose DSP: 


1) Chassaing, R., Horning, D.W., “Digital Signal Processing with Fixed and 
Floating-Point Processors”, COED, USA, Volume 1, Number 1, pages 1-4, 
March 1991. 


2) Defatta, David J., Joseph G. Lucas, and William S. Hodgkiss, Digital Sig- 
nal Processing: A System Design Approach, New York: John Wiley, 1988. 


9) 


Technical Articles 


Erskine, C., and S. Magar, “Architecture and Applications of a Second- 
Generation Digital Signal Processor,” Proceedings of IEEE International 
Conference on Acoustics, Soeech, and Signal Processing, USA, 1985. 


Essig, D., C. Erskine, E. Caudel, and S. Magar, “A Second-Generation 
Digital Signal Processor,” /EEE Journal of Solid-State Circuits, USA, Vol- 
ume SC-21, Number 1, pages 86-91, February 1986. 


Frantz, G., K. Lin, J. Reimer, and J. Bradley, “The Texas Instruments 
TMS320C25 Digital Signal Microcomputer,” /EEE Microelectronics, USA, 
Volume 6, Number 6, pages 10-28, December 1986. 


Gass, W., R. Tarrant, T. Richard, B. Pawate, M. Gammel, P. Rajasekaran, 
R. Wiggins, and C. Covington, “Multiple Digital Signal Processor Environ- 
ment for Intelligent Signal Processing,” Proceedings of the IEEE, USA, 
Volume 75, Number 9, pages 1246-1259, September 1987. 


Jackson, Leland B., Digital Filters and Signal Processing, Hingham, MA: 
Kluwer Academic Publishers, 1986. 


Jones, D.L., and T.W. Parks, A Digital Signal Processing Laboratory Using 
the TMS32010, Englewood Cliffs, NJ: Prentice-Hall, Inc., 1987. 


Lim, Jae, and Alan V. Oppenheim, Advanced Topics in Signal Processing, 
Englewood Cliffs, NJ: Prentice- Hall, Inc., 1988. 


10) Lin, K., G. Frantz, and R. Simar, Jr., “The TMS320 Family of Digital Signal 


Processors,” Proceedings of the IEEE, USA, Volume 75, Number 9, pages 
1143-1159, September 1987. 


11) Lovrich, A., Reimer, J., “An Advanced Audio Signal Processor” , Digest of 


Technical Papers for 1991 International Conference on Consumer Elec- 
tronics, June 1991. 


12) Magar, S., D. Essig, E. Caudel, S. Marshall and R. Peters, “An NMOS Digi- 


tal Signal Processor with Multiprocessing Capability,” Digest of IEEE Inter- 
national Solid-State Circuits Conference, USA, February 1985. 


13) Oppenheim, Alan V., and R.W. Schafer, Digital Signal Processing, Engle- 


wood Cliffs, NJ: Prentice-Hall, Inc., 1975 and 1988. 


14) Papamichalis, P.E.,andC.S. Burrus, “Conversion of Digit-Reversed to Bit- 


Reversed Order in FFT Algorithms,” Proceedings of ICASSP 89, USA, 
pages 984-987, May 1989. 


15) Papamichalis, P., and R. Simar, Jr., “The TMS320C30 Floating-Point Digi- 


tal Signal Processor,” /EEE Micro Magazine, USA, pages 13-29, Decem- 
ber 1988. 
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Technical Articles 


16) Papamichalis, P.E., “FFT Implementation on the TMS320C30,” Proceea- 
ings of ICASSP 88, USA, Volume D, page 1399, April 1988. 


17) Parks, T.W., and C.S. Burrus, Digital Filter Design, New York, NY: John Wiley 
and Sons, Inc., 1987. 


18) Peterson, C., Zervakis, M., Shehadeh, N., “Adaptive Filter Design and 
Implementation Using the TMS320C25 Microprocessor” , Computers in 
Education Journal, USA, Volume 3, Number 3, pages 12-16, July-Sep- 
tember 1993. 


19) Prado, J., and R. Alcantara, “A Fast Square-Rooting Algorithm Using a 
Digital Signal Processor,” Proceedings of IEEE, USA, Volume 75, Number 
2, pages 262-264, February 1987. 


20) Rabiner, L.R. and B. Gold, Theory and Applications of Digital Signal Pro- 
cessing, Englewood Cliffs, NJ: Prentice-Hall, Inc., 1975. 


21) Simar, Jr., R., and A. Davis, “The Application of High-Level Languages to 
Single-Chip Digital Signal Processors,” Proceedings of ICASSP 88, USA, 
Volume D, page 1678, April 1988. 


22) Simar, Jr., R., T. Leigh, P. Koeppen, J. Leach, J. Potts, and D. Blalock, “A 
40 MFLOPS Digital Signal Processor: the First Supercomputer on a Chip,” 
Proceedings of ICASSP 87, USA, Catalog Number 87CH2396-0, Volume 1, 
pages 535-538, April 1987. 


23) Simar, Jr., R., and J. Reimer, “The TMS320C25: a 100 ns CMOS VLSI Digi- 
tal Signal Processor,” 1986 Workshop on Applications of Signal Processing 
to Audio and Acoustics, September 1986. 


24) Texas Instruments, Digital Signal Processing Applications with the TMS320 
Family, 1986; Englewood Cliffs, NJ: Prentice-Hall, Inc., 1987. 


25) Treichler, J.R., C.R. Johnson, Jr., and M.G. Larimore, A Practical Guide 
to Adaptive Filter Design, New York, NY: John Wiley and Sons, Inc., 1987. 


Graphics/Imagery: 


1) Reimer, J., and A. Lovrich, “Graphics with the TMS32020,” WESCON/85 
Conference Record, USA, 1985. 


Speech/Voice: 


1) DellaMorte, J.,and P. Papamichalis, “Full-Duplex Real-Time Implementa- 
tion of the FED-STD-1015 LPC-10e Standard V.52 on the TMS320C25,” 
Proceedings of SPEECH TECH 839, pages 218-221, May 1989. 


Technical Articles 


Gray, A.H., and J.D. Markel, Linear Prediction of Soeech, New York, NY: 
Springer-Verlag, 1976. 


Frantz, G.A., and K.S. Lin, “A Low-Cost Speech System Using the 
TMS320C17,” Proceedings of SPEECH TECH ’87, pages 25-29, April 
1987. 


Papamichalis, P., and D. Lively, “Implementation of the DOD Standard 
LPC-10/52E on the TMS320C25,” Proceedings of SPEECH TECH ’87, 
pages 201-204, April 1987. 


Papamichalis, Panos, Practical Approaches to Speech Coding, Engle- 
wood Cliffs, NJ: Prentice-Hall, Inc., 1987. 


Pawate, B.I., and G.R. Doddington, “Implementation of a Hidden Markov 
Model-Based Layered Grammar Recognizer,” Proceedings of ICASSP 
89, USA, pages 801-804, May 1989. 


Rabiner, L.R., and R.W. Schafer, Digital Processing of Speech Signals, 
Englewood Cliffs, NJ: Prentice-Hall, Inc., 1978. 


Reimer, J.B. and K.S. Lin, “TMS320 Digital Signal Processors in Speech 
Applications,” Proceedings of SPEECH TECH ’88, April 1988. 


Reimer, J.B., M.L. McMahan, and W.W. Anderson, “Speech Recognition 
for a Low-Cost System Using a DSP,” Digest of Technical Papers for 1987 
International Conference on Consumer Electronics, June 1987. 


Control: 


1) 


2) 


Ahmed, I., “16-Bit DSP Microcontroller Fits Motion Control System Applica- 
tion,” PC/M, October 1988. 


Ahmed, I., “Implementation of Self Tuning Regulators with TMS320 Family 
of Digital Signal Processors,” MOTORCON ‘88, pages 248-262, Septem- 
ber 1988. 


Allen, C. and P. Pillay, “TMS320 Design for Vector and Current Control of 
AC Motor Drives”, Electronics Letters, UK, Volume 28, Number 23, pages 
2188-2190, November 1992. 


Panahi, |. and R. Restle, “OSPs Redefine Motion Control”, Motion Control 
Magazine, December 1993. 


Lovrich, A., G. Troullinos, and R. Chirayil, “An All-Digital Automatic Gain 
Control,” Proceedings of ICASSP 88, USA, Volume D, page 1734, April 
1988. 
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Technical Articles 


xii 


6) Ahmed, |.,and S. Meshkat, “Using DSPs in Control,” Contro/ Engineering, 
February 1988. 


7) Meshkat, S., and I. Ahmed, “Using DSPs in AC Induction Motor Drives,” 
Control Engineering, February 1988. 


8) Matsui, N. and M. Shigyo, “Brushless DC Motor Control Without Position 
and Speed Sensors” , IEEE Transactions on Industry Applications, USA, 
Volume 28, Number 1, Part 1, pages 120-127, January-February 1992. 


9) Hanselman, H., “LQG-Control of a Highly Resonant Disc Drive Head Posi- 
tioning Actuator,” /EEE Transactions on Industrial Electronics, USA, Vol- 
ume 35, Number 1, pages 100-104, February 1988. 


10) Bose, B.K., and P.M. Szczesny, “A Microcomputer-Based Control and 
Simulation of an Advanced IPM Synchronous Machine Drive System for 
Electric Vehicle Propulsion,” Proceedings of IECON 87, Volume 1, pages 
454-463, November 1987. 


11) Ahmed, I., and S. Lindquist, “Digital Signal Processors: Simplifying High- 
Performance Control,” Machine Design, September 1987. 


Multimedia: 


1) Reimer, J., “DSP-Based Multimedia Solutions Lead Way Enhancing 
Audio Compression Performance” , Dr. Dobbs Journal, December 1993. 


2) Reimer, J., G. Benbassat, and W. Bonneau ur., “Application Processors: 
Making PC Multimedia Happen” , Silicon Valley PC Design Conference, 
July 1991. 


Military: 


1) Papamichalis, P., and J. Reimer, “Implementation of the Data Encryption 
Standard Using the TMS32010,” Digital Signal Processing Applications, 
1986. 


Telecommunications: 


1) Ahmed, I., and A. Lovrich, “Adaptive Line Enhancer Using the 
TMS320C25,” Conference Records of Northcon/86, USA, 14/3/1-10, 
September/October 1986. 


2) Casale, S., R. Russo, and G. Bellina, “Optimal Architectural Solution Us- 
ing DSP Processors for the Implementation of an ADPCM Transcoder,” 
Proceedings of GLOBECOM ’89, pages 1267-1273, November 1989. 


3) Cole, C., A. Haoui, and P. Winship, “A High-Performance Digital Voice 
Echo Canceller on a SINGLE TMS32020,” Proceedings of ICASSP 86, 


9) 


Technical Articles 


USA, Catalog Number 86CH2243-4, Volume 1, pages 429-432, April 
1986. 


Cole, C., A. Haoui, and P. Winship, “A High-Performance Digital Voice 
Echo Canceller on a Single TMS32020,” Proceedings of IEEE Internation- 
al Conference on Acoustics, Speech and Signal Processing, USA, 1986. 


Lovrich, A., and J. Reimer, “A Multi-Rate Transcoder,” Transactions on 
Consumer Electronics, USA, November 1989. 


Lovrich, A. and J. Reimer, “A Multi-Rate Transcoder” , Digest of Technical 
Papers for 1989 International Conference on Consumer Electronics, June 7-9, 
1989. 


Lu, H., D. Hedberg, and B. Fraenkel, “Implementation of High-Speed Voice- 
band Data Modems Using the TMS320C25,” Proceedings of ICASSP 87, 
USA, Catalog Number 87CH2396-0, Volume 4, pages 1915-1918, April 
1987. 


Mock, P., “Add DTMF Generation and Decoding to DSP- uP Designs,” 
Electronic Design, USA, Volume 30, Number 6, pages 205-213, March 
1985. 


Reimer, J., M. McMahan, and M. Arjmand, “ADPCM on a TMS320 DSP 
Chip,” Proceedings of SPEECH TECH 85, pages 246-249, April 1985. 


10) Troullinos, G., and J. Bradley, “Split-Band Modem Implementation Using the 


TMS32010 Digital Signal Processor,” Conference Records of Electro/86 and 
Mini/Micro Northeast, USA, 14/1/1-21, May 1986. 


Automotive: 


1) 


Lin, K., “Trends of Digital Signal Processing in Automotive,” /nternational 
Congress on Transportation Electronic (CONVERGENCE 88), October 
1988. 


Consumer: 


1) 


2) 


Frantz, G.A., J.B. Reimer, and R.A. Wotiz, “Julie, The Application of DSP 
to a Product,” Speech Tech Magazine, USA, September 1988. 


Reimer, J.B., and G.A. Frantz, “Customization of a DSP Integrated Circuit 
for a Customer Product,” Transactions on Consumer Electronics, USA, 
August 1988. 


Reimer, J.B., P.E. Nixon, E.B. Boles, and G.A. Frantz, “Audio Customiza- 
tion of a DSP IC,” Digest of Technical Papers for 1988 International Con- 
ference on Consumer Electronics, June 8-10 1988. 
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Technical Articles / Trademarks 


Trademarks 


XIV 


Medical: 


1) Knapp and Townshend, “A Real-Time Digital Signal Processing System 
for an Auditory Prosthesis,” Proceedings of ICASSP 88, USA, Volume A, 
page 2493, April 1988. 


2) Morris, L.R., and P.B. Barszczewski, “Design and Evolution of a Pocket- 
Sized DSP Speech Processing System for a Cochlear Implant and Other 
Hearing Prosthesis Applications,” Proceedings of ICASSP 88, USA, Vol- 
ume A, page 2516, April 1988. 


Development Support: 


1) Mersereau, R., R. Schafer, T. Barnwell, and D. Smith, “A Digital Filter Design 
Package for PCs and TMS320,” MIDCON/84 Electronic Show and Conven- 
tion, USA, 1984. 


2) Simar, Jr., R., and A. Davis, “The Application of High-Level Languages to 
Single-Chip Digital Signal Processors,” Proceedings of ICASSP 88, USA, 
Volume 3, pages 1678-1681, April 1988. 


HP-UX is a trademark of Hewlett-Packard Company. 
MS-DOS is a registered trademark of Microsoft Corporation. 


OS/2 and PC-DOS are trademarks of International Business Machines Corpo- 
ration. 


PAL® is a registered trademark of Advanced Micro Devices, Inc. 
Solaris and SunOS are trademarks of Sun Microsystems, Inc. 


SPARC is a trademark of SPARC International, Inc., but licensed exclusively 
to Sun Microsystems, Inc. 


Windows is a registered trademark of Microsoft Corporation. 


320 Hotline On-line, Tl, XDS510, and XDS510WS are trademarks of Texas 
Instruments Incorporated. 


Micro Star is a trademark of Texas Instruments Incorporated. 


If You Need Assistance 


If You Need Assistance... 


— World-Wide Web Sites 


TI Online http://www.ti.com 

Semiconductor Product Information Center (PIC) —_http://www.ti.com/sc/docs/pic/home.htm 
DSP Solutions http://www.ti.com/dsps 

320 Hotline On-line™ http://www.ti.com/sc/docs/dsps/support.htm 


O North America, South America, Central America 
Product Information Center (PIC) (972) 644-5580 
TI Literature Response Center U.S.A. (800) 477-8924 
Software Registration/Upgrades (214) 638-0333 Fax: (214) 638-7742 
U.S.A. Factory Repair/Hardware Upgrades (281) 274-2285 
U.S. Technical Training Organization (972) 644-5580 
DSP Hotline Email: dsph@ti.com 
DSP Internet BBS via anonymous ftp to ftp://ftp.ti.com/pub/tms320bbs 


— Europe, Middle East, Africa 
European Product Information Center (EPIC) Hotlines: 


Multi-Language Support +33 130701169 Fax: +33 1 3070 1032 
Email: epic@ti.com 
Deutsch +49 8161 80 33 11 or +33 1 30 70 11 68 
English +33 13070 1165 
Francais +33 13070 11 64 
Italiano +33 1 30 70 11 67 
EPIC Modem BBS +33 1 30 70 11 99 
European Factory Repair +33 4 93 22 25 40 
Europe Customer Training Helpline Fax: +49 81 61 80 40 10 
QO Asia-Pacific 
Literature Response Center +852 2956 7288 Fax: +852 2 956 2200 
Hong Kong DSP Hotline +852 2956 7268 Fax: +852 2 956 1002 
Korea DSP Hotline +82 2551 2804 Fax: +82 2551 2828 
Korea DSP Modem BBS +82 2 551 2914 
Singapore DSP Hotline Fax: +65 390 7179 
Taiwan DSP Hotline +886 23771450 Fax: +886 2 377 2718 
Taiwan DSP Modem BBS +886 2 376 2592 
Taiwan DSP Internet BBS via anonymous ftp to ftp://dsp.ee.tit.edu.tw/pub/T1/ 
() Japan 
Product Information Center +0120-81-0026 (in Japan) Fax: +0120-81-0036 (in Japan) 
+03-3457-0972 or (INTL) 813-3457-0972 Fax: +03-3457-1259 or (INTL) 813-3457-1259 
DSP Hotline +03-3769-8735 or (INTL) 813-3769-8735 Fax: +03-3457-7071 or (INTL) 813-3457-7071 
DSP BBS via Nifty-Serve Type “Go TIASP” 


Lj Documentation 


When making suggestions or reporting errors in documentation, please include the following information that is on the title 
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Summarizes the features of the TMS320 family of products and presents typical applications. 
Describes the TMS320C54x DSP and lists its key features. 
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Describes the TMS320C 54x memory configuration and operation. Includes memory maps and 
descriptions of program memory, data memory, and I/O space. Also includes descriptions of 
the CPU memory-mapped registers. 
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Describes the TMS320C54x CPU operations. Includes information about the arithmetic logic 
unit, the accumulators, the shifter, the multiplier/adder unit, the compare, select, store unit, and 
the exponent encoder. 
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Describes the seven basic addressing modes of the TMS320C54x. 
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Describes the TMS320C54x program control mechanisms. Includes information about ad- 
dress generation, the program counter, the hardware stack, reset, interrupts, and power-down 


modes. 
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Describes the TMS320C54x pipeline operation and lists the pipeline latency cycles for these 
types of latencies. 
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Describes the TMS320C54x peripherals and how to control them. Includes information about 
the general-purpose I/O pins, timers, clock, and host port interface. 
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Describes the TMS320C54x serial ports. Includes information about the standard serial port 
interface, buffered serial port interface, multichannel buffered serial port interface, and time-di- 
vision multiplexed serial port interface. 
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Discusses the external bus interface and the timing of events involved in memory and I/O ac- 
cesses. Describes the hold mode and the wake-up sequence from IDLE3 mode. 
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Describes the JTAG emulator cable, how to construct a 14-pin connector on your target system, 
and how to connect the target system to the emulator. 
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B Development Support and Part Order Information ............cccccceeseeeeeeeeeeee 


Provides device part numbers and support tool ordering information for the TMS320C54x and 
development support information available from TI and third-party vendors. 
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Chapter 1 


Introduction 


The TMS320C54x devices are fixed-point digital signal processors (DSPs) in 
the TMS320 family. The ‘54x meets the specific needs of real-time embedded 
applications, such as telecommunications. 


The ’54x central processing unit (CPU), with its modified Harvard architecture, 
features minimized power consumption and a high degree of parallelism. In 
addition to these features, the versatile addressing modes and instruction set 
in the 54x improve the overall system performance. 
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1.1. TMS320 Family Overview 


The TMS320 family consists of fixed-point, floating-point, and multiprocessor 
digital signal processors (DSPs). The TMS320 architecture is designed specif- 
ically for real-time signal processing. The following characteristics make this 
family the ideal choice for a wide range of processing applications: 


Very flexible instruction set 
Inherent operational flexibility 
High-speed performance 
Innovative parallel architecture 
Cost-effectiveness 

C-friendly architecture 


DOCU 


1.1.1. History, Development, and Advantages of TMS320 DSPs 


In 1982, Texas Instruments introduced the TMS3201 0 — the first fixed-point 
DSP in the TMS320 family. Before the end of the year, Electronic Products 
magazine awarded the TMS32010 the title “Product of the Year”. Today, the 
TMS320 family consists of these generations: ’C 1x, ’C2x, ’C2xx, ’C5x, ’C54x, 
and ’C6x fixed-point DSPs; ’C3x and ’C4x floating-point DSPs; and ’C8x multi- 
processor DSPs. 


Devices within a generation of the TMS320 family have the same CPU struc- 
ture but different on-chip memory and peripheral configurations. Spinoff 
devices use new combinations of on-chip memory and peripherals to satisfy 
a wide range of needs in the worldwide electronics market. By integrating 
memory and peripherals onto a single chip, TMS320 devices reduce system 
costs and save circuit board space. Figure 1-1 illustrates the performance 
gains that the TMS320 family has made. 


TMS320 Family Overview 


Figure 1—1. Evolution of the TMS320 Family 


C6000 
(‘C62x, ’C67x) 


C2000 
(’C20x, ’C24x) High performance 


, Power-efficient 
C1/2x performance 


Control optimized 


1.1.2 Typical Applications for the TMS320 Family 


Table 1—1 lists some typical applications for the TMS320 family of DSPs. The 
TMS320 DSPs offer more adaptable approaches to traditional signal-proces- 
sing problems such as vocoding and filtering than standard microprocessor/ 
microcomputer devices. They also support complex applications that often 
require multiple operations to be performed simultaneously. 
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Table 1-1. Typical Applications for the TMS320 DSPs 


Automotive 


Adaptive ride control 


Antiskid brakes 

Cellular telephones 

Digital radios 

Engine control 

Navigation and global positioning 
Vibration analysis 

Voice commands 

Anticollision radar 


General-Purpose 


Graphics/Imaging 


Adaptive filtering 
Convolution 

Correlation 

Digital filtering 

Fast Fourier transforms 
Hilbert transforms 
Waveform generation 
Windowing 


Instrumentation 


Digital filtering 


Diagnostic equipment 


Function generation 
Pattern matching 
Phase-locked loops 
Seismic processing 
Spectrum analysis 
Transient analysis 


Consumer 


Digital radios/TVs 

Educational toys 

Music synthesizers 

Pagers 

Power tools 

Radar detectors 

Solid-state answering machines 


3-D rotation 

Animation/digital maps 
Homomorphic processing 

Image compression/transmission 
Image enhancement 

Pattern recognition 

Robot vision 

Workstations 


Medical 


Fetal monitoring 
Hearing aids 

Patient monitoring 
Prosthetics 
Ultrasound equipment 


Telecommunications 


1200- to 33 600-bps modems 
Adaptive equalizers 

ADPCM transcoders 

Cellular telephones 

Channel multiplexing 

Data encryption 

Digital PBXs 

Digital speech interpolation (DSI) 
DTMF encoding/decoding 


Echo cancellation 
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Faxing 

Line repeaters 

Personal communications 
systems (PCS) 

Personal digital assistants (PDA) 

Speaker phones 

Spread spectrum communications 

Video conferencing 

X.25 packet switching 


Control 


Disk drive control 
Engine control 
Laser printer control 
Motor control 
Robotics control 
Servo control 


Industrial 


Numeric control 
Power-line monitoring 
Robotics 

Security access 


Military 


Image processing 

Missile guidance 
Navigation 

Radar processing 

Radio frequency modems 
Secure communications 
Sonar processing 


Voice/Speech 


Speaker verification 
Speech enhancement 
Speech recognition 
Speech synthesis 
Speech vocoding 
Text-to-speech 

Voice mail 
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1.2 TMS320C54x Overview 


The ’54x has a high degree of operational flexibility and speed. It combines an 
advanced modified Harvard architecture (with one program memory bus, 
three data memory buses, and four address buses), a CPU with application- 
specific hardware logic, on-chip memory, on-chip peripherals, and a highly 
specialized instruction set. Spinoff devices that combine the 54x CPU with 
customized on-chip memory and peripheral configurations have been, and 
continue to be, developed for specialized areas of the electronics market. 


The ’54x devices offer these advantages: 


L 


a 


L 


Ly 


Enhanced Harvard architecture built around one program bus, three data 
buses, and four address buses for increased performance and versatility 


Advanced CPU design with a high degree of parallelism and application- 
specific hardware logic for increased performance 


A highly specialized instruction set for faster algorithms and for optimized 
high-level language operation 


Modular architecture design for fast development of spinoff devices 


Advanced IC processing technology for increased performance and low 
power consumption 


Low power consumption and increased radiation hardness because of 
new static design techniques 
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1.3 TMS320C54x Key Features 
This section lists the key features of the 54x DSPs. 


1) CPU 


Advanced multibus architecture with one program bus, three data 
buses, and four address buses 


M 40-bit arithmetic logic unit (ALU), including a 40-bit barrel shifter and 
two independent 40-bit accumulators 

m 17-bit x 17-bit parallel multiplier coupled to a 40-bit dedicated adder 
for nonpipelined single-cycle multiply/accumulate (MAC) operation 

Hm Compare, select, store unit (CSSU) for the add/compare selection of 
the Viterbi operator 

mM Exponent encoder to compute the exponent of a 40-bit accumulator 
value in a single cycle 

m Two address generators, including eight auxiliary registers and two 
auxiliary register arithmetic units 

m Dual-CPU/core architecture on the ’5420 

Lj Memory 

mM 192K words x 16-bit addressable memory space (64K-words pro- 
gram, 64K-words data, and 64K-words I/O), with extended program 
memory in the '548, 549, 5402, 5410, and 5420. 

m On-chip configurations as follows (in K words): 


Program Program/Data 


Device ROM ROM DARAMt SARAMt 
BAI 20 8 5 0 
542 2 0 10 0 
543 B 0 10 0 
545 32 16 6 0 
546 32 16 6 0 
548 c 0 8 24 
549 16 16 8 24 
5402 4 4 16 0 
5410 16 0 8 56 
5420 0 0 32 168 


T Dual-access RAM 
+ Single-access RAM 
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(J Instruction set 


Single-instruction repeat and block repeat operations 


Block memory move instructions for better program and data 
management 


Instructions with a 32-bit long operand 

Instructions with 2- or 3-operand simultaneous reads 
Arithmetic instructions with parallel store and parallel load 
Conditional-store instructions 


Fast return from interrupt 


Lj) On-chip peripherals 


Software-programmable wait-state generator 
Programmable bank switching 


On-chip phase-locked loop (PLL) clock generator with internal oscilla- 
tor or external clock source. With the external clock source, there are 
several multiplier values available from one of the following device 
options: 


Option 1 Option2 Opition3 


1.0 1.0 Software-programmable PLLT 
1.5 4.0 
2.0 4.5 
3.0 5.0 


T The ’541B, ’545A, ’546A, ’548, '549, 5402, 5410, and '5420 have a software-pro- 
grammable PLL and two additional saturation modes. The software-programmable 
PLL is described in section 8.5.2, Software-Programmable PLL, on page 8-27. The 
saturation modes are described in section 4.1.2, Processor Mode Status Register 
(PMST), on page 4-6. 

Each device offers selection of clock modes from one option list only. 


External bus-off control to disable the external data bus, address bus, 
and control signals 


Data bus with a bus holder feature 


Programmable timer 
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m@ Ports: 
Serial Ports 
Multi- 
Host Port Channel _ Time-Division 
Device Interface Synchronous Buffered Buffered Multiplexed 
i a a a no iinn) 
542 1 0 1 0 1 
543 0 0 1 0 1 
545 1 1 1 0 0 
546 0 1 1 0 0 
548 1 0 2 0 1 
549 1 0 2 0 1 
5402 1 0 0 2 0 
5410 1 0 0 3 0 
5420 1 0 0 6 0 


1) Speed: 25/20/15/12.5/10-nst execution time for a single-cycle, fixed-point 
instruction (40 MIPS/50 MIPS/66 MIPS/80 MIPS/100 MIPS): 


Device Power Supply Speed Package 
B41 5V 25 ns 100-pin TQFP 
3V/3.3V 25 ns/20 ns 100-pin TQFP 
41B  3V/3.3V 15 ns 100-pin TQFP 
542 5V 25 ns 144-pin TQFP 
3V/3.3V 25 ns/20 ns 128-pin/144-pin TQFP 
543, 3 V/3.3V 25 ns/20 ns 100-pin TQFP 
545 3: V/3.3V 25 ns/20 ns 128-pin TQFP 
‘545A 3V/3.3V 15 ns 128-pin TQFP 
‘546-38 V/3.3V 25 ns/20 ns 100-pin TQFP 
‘B46A 3V/3.3V 15 ns 100-pin TQFP 
‘548 3.38 V 20 ns/15 ns 144-pin TQFP 
‘549, 3.3 V 15 ns/12.5 ns 144-pin TQFP/144-pin 
Micro Star™ BGA 
VC549 3.3V(2.5core) 10ns 144-pin TQFP/144-pin 


Micro Star™ BGA 
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Table Continued 


Device Power Supply Speed Package 

‘vc5402 3.3V(1.8core) 10ns 144-pin TQFP/144-pin 
Micro Star™ BGA 

"VC5410 3.3V (2.5 core) 10ns 144-pin TQFP/176-pin 
Micro Star™ BGA 

'vC5420 3.3V(1.8core) 10ns 144-pin TQFP/144-pin 


Micro Star™ BGA 


[J Power 


Mm Power consumption control with IDLE 1, IDLE 2, and IDLE 3 
instructions for power-down modes 


m Control to disable the CLKOUT signal 


(J Emulation: IEEE Standard 1149.1 boundary scan logic interfaced to 
on-chip scan-based emulation logic 
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Architectural Overview 


This chapter provides an overview of the architectural structure of the ’54x, 
which comprises the central processing unit (CPU), memory, and on-chip 
peripherals. 


The ‘54x DSPs use an advanced modified Harvard architecture that maxi- 
mizes processing power with eight buses. 


Separate program and data spaces allow simultaneous access to program 
instructions and data, providing a high degree of parallelism. For example, 
three reads and one write can be performed ina single cycle. Instructions with 
parallel store and application-specific instructions fully utilize this architecture. 
In addition, data can be transferred between data and program spaces. Such 
parallelism supports a powerful set of arithmetic, logic, and bit-manipulation 
operations that can all be performed in a single machine cycle. Also, the 54x 
includes the control mechanisms to manage interrupts, repeated operations, 
and function calling. 


Figure 2-1 shows a functional block diagram of the 54x, which includes the 
principal blocks and bus structure. 
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Block Diagram 


Figure 2-1. Block Diagram of TMS320C54x Internal Hardware 
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2.1 Bus Structure 


The ’54x architecture is built around eight major 16-bit buses (four program/ 
data buses and four address buses): 


._) The program bus (PB) carries the instruction code and immediate 
operands from program memory. 


1 Three data buses (CB, DB, and EB) interconnect to various elements, 
such as the CPU, data address generation logic, program address 
generation logic, on-chip peripherals, and data memory. 


m The CB and DB carry the operands that are read from data memory. 
m The EB carries the data to be written to memory. 


(1 Four address buses (PAB, CAB, DAB, and EAB) carry the addresses 
needed for instruction execution. 


The 54x can generate up to two data-memory addresses per cycle using the 
two auxiliary register arithmetic units (ARAUO and ARAU‘1). 


The PB can carry data operands stored in program space (for instance, a coef- 
ficient table) to the multiplier and adder for multiply/accumulate operations or 
to a destination in data space for data move instructions (MVPD and READA). 
This capability, in conjunction with the feature of dual-operand read, supports 
the execution of single-cycle, 3-operand instructions such as the FIRS instruc- 
tion. 


The ’54x also has an on-chip bidirectional bus for accessing on-chip peripher- 
als. This bus is connected to DB and EB through the bus exchanger in the CPU 
interface. Accesses that use this bus can require two or more cycles for reads 
and writes, depending on the peripheral’s structure. 


Table 2-1 summarizes the buses used by various types of accesses. 
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Table 2-1. Bus Usage for Read and Write Accesses 
Address Bus 


Access Type “PAB CAB. DAB EAB PB 
Program read V V 
Program write V 

Data single read | 

Data dual read V V 

Data long (32-bit) read V(hw) (Iw) 

Data single write V 

Data read/data write V V 

Dual read/coefficient read V V V V 
Peripheral read V 

Peripheral write V 

Legend: hw high 16-bit word 


lw low 16-bit word 


Data Bus 
CB DB 
v v 
V(hw) vw) 

V 
v v 
NI 


EB 


Internal Memory Organization 


2.2 Internal Memory Organization 


The 54x memory is organized into three individually selectable spaces: pro- 
gram, data, and I/O space. All 54x devices contain both random-access 
memory (RAM) and read-only memory (ROM). Among the devices, two types 
of RAM are represented: dual-access RAM (DARAM) and single-access RAM 
(SARAM). Table 2—2 shows how much ROM, DARAM, and SARAM are avail- 
able on the different 54x devices. The ’54x also has 26 CPU registers plus 
peripheral registers that are mapped in data-memory space. The ’54x memory 
types and features are introduced in the sections following this paragraph. For 
details about configuring and using the various memory blocks, see Chapter 
3, Memory. 


Table 2-2. Program and Data Memory on the TMS320C54x Devices 


Memory 
Type 
ROM: 


Program 


Program/ 
data 


DARAMt 


SARAMt 


541 


28K 
20K 
8K 


5K 
0 


542 543 545 546 548 549 5402. °5410 = °5 420 


2K 2K 48K 48K 2K 16K 4K 16K 0 
2K 2K 32K 32K 2K 16K 4K 16K 0 
0 0 16K 16K 0 16K 4K 0 0 
10K 10K 6K 6K 8K 8K 16K 8K 32K 
0 0 0 0 24K 24K 0 56K 168K 


T You can configure the dual-access RAM (DARAM) and single-access RAM (SARAM) as data memory or program/data 


memory. 


2.2.1 On-Chip ROM 


The on-chip ROM is part of the program memory space and, in some cases, 
part of the data memory space. The amount of on-chip ROM available on each 
device varies, as shown in Table 2-2. 


On devices with a small amount of ROM (2K words), the ROM contains a boot- 
loader that is useful for booting to faster on-chip or external RAM. For boot- 
loading details on all ’54x devices except the 548 and’549, see TMS320C54x 
DSP Reference Set, Volume 4: Applications Guide. For bootloading details on 
the 548 and 549, see TMS320C548/549 Bootloader Technical Reference. 


On devices with larger amounts of ROM, a portion of the ROM may be mapped 
into both data and program space (except the 5410). The larger ROMs are 
also custom ROMs: you provide the code or data to be programmed into the 
ROM in object file format, and Texas Instruments generates the appropriate 
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process mask to program the ROM. For details on submitting ROM codes to 
Texas Instruments, see Appendix C, Submitting ROM Codes to TI. 


2.2.2 On-Chip Dual-Access RAM (DARAM) 


The DARAM is composed of several blocks. Because each DARAM block can 
be accessed twice per machine cycle, the central processing unit (CPU) and 
peripherals such as the buffered serial port (BSP) and host port interface (HPI) 
can read from and write to a DARAM memory address in the same cycle. The 
DARAM is always mapped in data space and is primarily intended to store data 
values. It can also be mapped into program space and used to store program 
code. 


2.2.3 On-Chip Single-Access RAM (SARAM) 


The SARAM is composed of several blocks. Each block is accessible once per 
machine cycle for either a read or a write. The SARAM is always mapped in 
data space and is primarily intended to store data values. It can also be 
mapped into program space and used to store program code. 


2.2.4 On-Chip Memory Security 


The 54x maskable memory security option protects the contents of on-chip 
memories. When you designate this option, no externally originating instruc- 
tion can access the on-chip memory spaces. 


2.2.5 Memory-Mapped Registers 


The data memory space contains memory-mapped registers for the CPU and 
the on-chip peripherals. These registers are located on data page 0, simplify- 
ing access to them. The memory-mapped access provides a convenient way 
to save and restore the registers for context switches and to transfer informa- 
tion between the accumulators and the other registers. 
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2.3 Central Processing Unit (CPU) 
The ‘54x CPU is common to all the 54x devices. The 54x CPU contains: 


40-bit arithmetic logic unit (ALU) 

Two 40-bit accumulators 

Barrel shifter 

17 x 17-bit multiplier 

40-bit adder 

Compare, select, and store unit (CSSU) 
Data address generation unit 

Program address generation unit 
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2.3.1 Arithmetic Logic Unit (ALU) 


The ’54x performs 2s-complement arithmetic with a 40-bit arithmetic logic unit 
(ALU) and two 40-bit accumulators (accumulators A and B). The ALU can also 
perform Boolean operations. The ALU uses these inputs: 


16-bit immediate value 

16-bit word from data memory 

16-bit value in the temporary register, T 
Two 16-bit words from data memory 
32-bit word from data memory 

40-bit word from either accumulator 


UOUUOUU 


The ALU can also function as two 16-bit ALUs and perform two 16-bit opera- 
tions simultaneously. See section 4.2, Arithmetic Logic Unit (ALU), on page 
4-11, for more details about ALU operation. 


2.3.2 Accumulators 


Accumulators A and B (see Figure 2—1 on page 2-2) store the output from the 
ALU or the multiplier/adder block. They can also provide a second input to the 
ALU; accumulator A can be an input to the multiplier/adder. Each accumulator 
is divided into three parts: 


[1 Guard bits (bits 39-32) 
[1 High-order word (bits 31-16) 
[_j Low-order word (bits 15-0) 


Instructions are provided for storing the guard bits, for storing the high- and the 
low-order accumulator words in data memory, and for transferring 32-bit accu- 
mulator words in or out of data memory. Also, either of the accumulators can 
be used as temporary storage for the other. See section 4.3, Accumulators A 
and B, on page 4-15, for more details about the features of these accumula- 
tors. 
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Central Processing Unit (CPU) 


2.3.3 Barrel Shifter 


The ’54x barrel shifter has a 40-bit input connected to the accumulators or to 
data memory (using CB or DB), and a 40-bit output connected to the ALU or 
to data memory (using EB). The barrel shifter can produce a left shift of 0 to 
31 bits and a right shift of 0 to 16 bits on the input data. The shift requirements 
are defined in the shift count field of the instruction, the shift count field (ASM) 
of status register ST1, or in the temporary register T (when it is designated as 
a shift count register). 


The barrel shifter and the exponent encoder normalize the values in an accu- 
mulator in a single cycle. The LSBs of the output are filled with Os, and the 
MSBs can be either zero filled or sign extended, depending on the state of the 
sign-extension mode bit (SXM) in ST1. Additional shift capabilities enable the 
processor to perform numerical scaling, bit extraction, extended arithmetic, 
and overflow prevention operations. See section 4.4, Barrel Shifter, on page 
4-19, for more details about the function and use of the shifter. See section 4.7, 
Exponent Encoder, on page 4-29, for more information about the encoder’s 
accumulator-normalizing function. 


2.3.4 Multiplier/Adder Unit 
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The multiplier/adder unit performs 17 x 17-bit 2s-complement multiplication 
with a 40-bit addition in a single instruction cycle. The multiplier/adder block 
consists of several elements: a multiplier, an adder, signed/unsigned input 
control logic, fractional control logic, a zero detector, a rounder (2s comple- 
ment), overflow/saturation logic, and a 16-bit temporary storage register (T). 
The multiplier has two inputs: one input is selected from T, a data-memory op- 
erand, or accumulator A; the other is selected from program memory, data 
memory, accumulator A, or an immediate value. 


The fast, on-chip multiplier allows the 54x to perform operations efficiently 
such as convolution, correlation, and filtering. In addition, the multiplier and 
ALU together execute multiply/accumulate (MAC) computations and ALU 
operations in parallel in a single instruction cycle. This function is used in deter- 
mining the Euclidian distance and in implementing symmetrical and LMS 
filters, which are required for complex DSP algorithms. See section 4.5, Multi- 
plier/Adder Unit, on page 4-21, for more details about the multiplier/adder unit. 


Central Processing Unit (CPU) / Data Addressing 


2.3.5 Compare, Select, and Store Unit (CSSU) 


The compare, select, and store unit (CSSU) performs maximum comparisons 
between the accumulator’s high and low word, allows both the test/control flag 
bit (TC) in status register STO and the transition register (TRN) to keep their 
transition histories, and selects the larger word in the accumulator to store into 
data memory. The CSSU also accelerates Viterbi-type butterfly computations 
with optimized on-chip hardware. See section 4.6, Compare, Select, and Store 
Unit (CSSU), on page 4-26, for more details about this unit. 


2.4 Data Addressing 


The ’54x offers seven basic data addressing modes: 


1 Immediate addressing uses the instruction to encode a fixed value. 
(41 Absolute addressing uses the instruction to encode a fixed address. 


[1 Accumulator addressing uses accumulator A to access a location in 
program memory as data. 


_j Direct addressing uses seven bits of the instruction to encode the lower 
seven bits of an address. The seven bits are used with the data page point- 
er (DP) or the stack pointer (SP) to determine the actual memory address. 


Lj Indirect addressing uses the auxiliary registers to access memory. 


1 Memory-mapped register addressing uses the memory-mapped registers 
without modifying either the current DP value or the current SP value. 


1 Stack addressing manages adding and removing items from the system 
stack. 


During the execution of instructions using direct, indirect, or memory-mapped 
register addressing, the data-address generation logic (DAGEN) computes 
the addresses of data-memory operands. For a detailed discussion of the data 
addressing modes, see Chapter 5, Data Addressing. 
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Program Memory Addressing / Pipeline Operation 


2.5 Program Memory Addressing 


Program memory is usually addressed on a ’54x device with the program 
counter (PC). With some instructions, however, absolute addressing may be 
used to access data items that have been stored in program memory. (Abso- 
lute addressing is described in Chapter 5, Data Addressing.) 


The PC, which is used to fetch individual instructions, is loaded by the 
program-address generation logic (PAGEN). Typically, the PAGEN incre- 
ments the PC as sequential instructions are fetched. However, the PAGEN 
may load the PC with a non-sequential value as a result of some instructions 
or other operations. Operations that cause a discontinuity include branches, 
calls, returns, conditional operations, single-instruction repeats, multiple- 
instruction repeats, reset, and interrupts. For calls and interrupts, the current 
PC is saved onto the stack, which is referenced by the stack pointer (SP). 
When the called function or interrupt service routine is finished, the PC value 
that was saved is restored from the stack via a return instruction. 


For a detailed discussion of the hardware and software factors in program 
address generation, see Chapter 6, Program Memory Addressing. 


2.6 Pipeline Operation 
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An instruction pipeline consists of a sequence of operations that occur during 
the execution of an instruction. The 54x pipeline has six levels: prefetch, fetch, 
decode, access, read, and execute. At each of the levels, an independent op- 
eration occurs. Because these operations are independent, from one to six 
instructions can be active in any given cycle, each instruction at a different 
stage of completion. Typically, the pipeline is full with a sequential set of 
instructions, each at one of the six stages. When a PC discontinuity occurs, 
such as during a branch, call, or return, one or more stages of the pipeline may 
be temporarily unused. For more details about the pipeline operation, see 
Chapter 7, Pipeline. 


On-Chip Peripherals 


2.7 On-Chip Peripherals 


All the ’54x devices have the same CPU, but different on-chip peripherals are 
connected to their CPUs. The ’54x devices have these on-chip peripheral 
options: 


General-purpose I/O pins: XF and BIO 

Timer 

Clock generator 

Host port interface 

M 8-bit standard (542, 545, 548, 549) 

M 8-bit enhanced (’5402, 5410 — see note below) 

m 16-bit enhanced (’5420 — see note below) 

Synchronous serial port (541, 545, and ’546) 

Buffered serial port (542, 543, 545, 546, ’548, and ’549) 

Multichannel buffered serial port (McBSP) (’5402, 5410, and ’5420 — see 
note below) 

Time-division multiplexed (TDM) serial port (542, ’543, 548, and ’549). 
Software-programmable wait-state generator 

Programmable bank-switching module 


UOUOU 


DOO Uo 


Note: Enhanced Peripherals 


For more detailed information on the enhanced peripherals, see volume 5 
of this reference set: TMS320C54x DSP, Enhanced Peripherals, literature 
number SPRU302. 


2.7.1 General-Purpose I/O Pins 


Each ’54x device has two general-purpose I/O pins: BIO and XF. BIO is an 
input pin that can be used to monitor the status of external devices. XF is a 
software-controlled output pin that allows you to signal external devices. See 
section 8.3, General-Purpose I/O, on page 8-20, for more details about BIO 
and XF. 


2.7.2 Software-Programmable Wait-State Generator 


The software-programmable wait-state generator extends external bus cycles 
up to seven machine cycles (14 machine cycles in the 549 , 5402, 5410, and 
5420) to interface with slower off-chip memory and I/O devices. The software 
wait-state generator is incorporated without any external hardware. For off- 
chip memory accesses, from zero to seven wait states can be specified within 
the software wait-state register (SWWSR) for each 32K-word block of program 
and data memory, and for the 64K-word block of I/O space. See section 10.3.1, 
Wait-State Generator, on page 10-5, for more details. 
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On-Chip Peripherals 


2.7.3. Programmable Bank-Switching Logic 


The programmable bank-switching logic can automatically insert one cycle 
when an access crosses memory bank boundaries inside program memory 
or data memory. One cycle can also be inserted when an access crosses from 
program memory to data memory. This extra cycle prevents bus contention by 
allowing memory devices to release the bus before other devices start driving 
the bus. The size of memory bank for bank switching is defined by the bank 
switching control register (BSCR). See section 10.3.2, Bank-Switching Logic, 
on page 10-8, for more details. 


2.7.4 Host Port Interface 


The host port interface (HPI) is a parallel port that provides an interface to a 
host processor. Information is exchanged between the ’54x and the host pro- 
cessor through 54x on-chip memory thatis accessible to both the host proces- 
sor and the ’54x. Table 2-3 identifies the HPl-equipped ’54x devices. See sec- 
tion 8.6, Host Port Interface, on page 8-37, for more details about HPI opera- 
tion. 


Table 2-3. Host Port Interfaces on the TMS320C54x Devices 


Host Port 
Interface 


Standard 8-bit 
HPI 


Enhanced 8-bit 
HPI 


Enhanced 16-bit 
HPI 


541 


0 


0 


0 


542 543 545 546 548 549 =«°5402) «=—°5410 3=—*5420 
1 0 1 0 1 1 0 0 0 
0 0 0 0 0 0 1 1 0 
0 0 0 0 0 0 0 0 1 


2.7.5 Hardware Timer 
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The ’54x features a 16-bit timing circuit with a 4-bit prescaler. The timer counter 
is decremented by 1 at every CLKOUT cycle. Each time the counter decre- 
ments to 0, atimer interrupt is generated. The timer can be stopped, restarted, 
reset, or disabled by specific status bits. See section 8.4, Timer, on page 8-21, 
for more details. 


On-Chip Peripherals 


2.7.6 Clock Generator 


The clock generator consists of an internal oscillator and a phase-locked loop 
(PLL) circuit. The clock generator can be driven internally by a crystal resona- 
tor with the internal oscillator or externally by a clock source. The PLL circuit 
can generate an internal CPU clock by multiplying the clock source by a specif- 
ic factor; thus, you should use a clock source with a lower frequency than that 
of the CPU. For more details about the generator, see section 8.5, Clock 
Generator, on page 8-26. 
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2.8 Serial Ports 


The serial ports on the ’54x vary by device, and are represented by four types: 
synchronous, buffered, multichannel buffered (McBSP), and time-division 
multiplexed (TDM). See Table 2—4 for the number of each type on the various 
54x devices. The sections that follow provide an introduction to the four types 
of serial ports. For more details about these ports, see Chapter 9, Serial Ports. 
For detailed information about the McBSPs, see volume 5 of this reference set: 
TMS320C54x DSP, Enhanced Peripherals, literature number SPRU302. 


Table 2-4. Serial Port Interfaces on the TMS320C54x Devices 


Serial 


541 


542 543 545 546 548 549 5402. «5410 = 5420 


Ports 


Synchronous 
Buffered 


Multichannel 
Buffered 


TDM 


0 0 1 1 0 0 0 0 0 
1 1 1 1 2 2 0 0 0 
0 0 0 0 0 0 2 3 6 
1 1 0 0 1 1 0 0 0 


2.8.1. Synchronous Serial Ports 


Synchronous serial ports are high-speed, full-duplexed serial ports that pro- 
vide direct communication with serial devices such as codecs, analog-to-digi- 
tal (A/D) converters, and other serial systems. When more than one synchro- 
nous serial port resides on a 54x, these ports are identical but independent. 
Each synchronous serial port can operate at up to one-fourth the machine 
cycle rate (CLKOUT). The synchronous serial port transmitter and receiver are 
double buffered and individually controlled by maskable external interrupt sig- 
nals. Data is framed either as bytes or as words. 


2.8.2 Buffered Serial Ports 
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A buffered serial port (BSP) is a synchronous serial port that is enhanced with 
an autobuffering unit and is clocked at the full CLKOUT rate. It is full-duplexed 
and double-buffered to offer flexible data stream length. The autobuffering unit 
supports high-speed transfers and reduces the overhead of servicing interrupts. 


Serial Ports 


2.8.3 Multichannel Buffered Serial Ports (McBSPs) 


The McBSP is an enhanced buffered serial port that includes the following 
standard features: buffered data registers, full duplex communication, and in- 
dependent clocking and framing for receive and transmit. In addition, the 
McBSP includes the following enhanced features: internal programmable 
clock and frame generation, multichannel mode, and general purpose I/O. For 
detailed information about the McBSPs, see volume 5 of this reference set: 
TMS320C54x DSP, Enhanced Peripherals, literature number SPRU302. 


2.8.4 TDM Serial Ports 


A time-division multiplexed (TDM) serial port is a synchronous serial port that 
is enhanced to allow time-division multiplexing of the data. It can be configured 
for either synchronous operations or for TDM operations and is commonly 
used in multiprocessor applications. 
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2.9 External Bus Interface 


The ’54x can address up to 64K words of data memory, 64K words of program 
memory (8M words in the 548, 549, and ’5410; 1M words in the 5402; 256K 
words in the 5420), and up to 64K words of 16-bit parallel I/O ports. Accesses 
to either external memory or I/O ports take place through the external inter- 
face. Individual space-select signals, DS, PS, and IS, allow the selection of 
physically separate spaces. 


The interface’s external ready input signal and software-generated wait states 
allow the processor to interface with memory and I/O devices of many different 
speeds. The interface’s hold modes allow an external device to take control 
of the ’54x buses; in this way, an external device can access the resources in 
the program, data, and I/O spaces. 


External memory can be accessed by most ’54x instructions. However, acces- 
sing I/O ports requires the use of special instructions: PORTR and PORTW. 


See Chapter 10, External Bus Operation, for more details about interfacing the 
54x to external devices. 


2.10 IEEE Standard 1149.1 Scanning Logic 


2-16 


The IEEE Standard 1149.1 scanning-logic circuitry is used for emulation and 
testing purposes only. This logic provides the boundary scan to and from the 
interfacing devices. Also, it can be used to test pin-to-pin continuity as well as 
to perform operational tests on devices peripheral to the ‘54x. The IEEE Stan- 
dard 1149.1 scanning logic is interfaced to internal scanning-logic circuitry that 
has access to all of the on-chip resources. Thus, the ‘54x can perform 
on-board emulation using the IEEE Standard 1149.1 serial scan pins and the 
emulation-dedicated pins. For more information, see the appendix titled De- 
sign Considerations for Using XDS510 Emulator. 


Chapter 3 


Memory 


This chapter describes the 54x memory configuration and operation. In gener- 
al, the 54x devices have a total memory space of 192K 16-bit words. This 
space is divided into three specific memory segments: 64K words of program, 
64K words of data, and 64K words of I/O. In some 54x devices, the memory 
structure has been modified through overlay and paging schemes to allow 
additional memory space. 


The parallel nature of the ’54x architecture and the dual-access capability of 
the on-chip RAM allow the ’54x to perform four concurrent memory operations 
in any given machine cycle: an instruction fetch, two-operand reads, and an 
operand write. 


There are several advantages of operating from on-chip memory: 


_j Higher performance because no wait states are required 
_j Lower cost than external memory 
1 Lower power than external memory 


The main advantage of operating from off-chip memory is the ability to access 
a larger memory space. 
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Memory Space 


3.1 Memory Space 
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The '54x’s memory is organized into three individually selectable spaces: 
program, data, and I/O. Within any of these spaces, RAM, ROM, EPROM, 
EEPROM, or memory-mapped peripherals can reside either on- or off-chip. 


The program memory space contains the instructions to execute, as well as 
tables used in execution. The data-memory space stores data used by instruc- 
tions. The I/O memory space interfaces to external memory-mapped peripher- 
als and can also serve as extra data storage space. 


Depending on the chip version, several on-chip memory types are available 
on the ’54x: dual-access RAM (DARAM), single-access RAM (SARAM), and 
ROM. The RAMs are always mapped into data space, but may also be 
mapped into program space. The ROM may be activated and mapped into 
program space; it can also be mapped, in part, into data space. 


There are three CPU status register bits that affect memory configuration. The 
effects of these bits are device-specific. 


The MP/MC, OVLY, and DROM bits are located in the processor mode status 
register (PMST). For more details, see section 4.1, CPU Status and Control 
Registers, on page 4-2. 


Figure 3-1 through Figure 3—4 show the ’54x device’s data and program 
memory maps and how the maps are affected by the MP/MC, OVLY, and 
DROM bits. 


Figure 3—1. Memory Maps for the 541 


"541 Program Memory 


0000h 0000h-13FFh External 
0000h-—007Fh Reserved 
0080h-13FFh On-chip DARAM 


2000h 


4000h 


1400h-8FFFh External 


6000h 


8000h 


Ao00h 


9000h-FF7Fh On-chip ROM 


FF80h—-FFFFh Interrupt vectors 


Conn (internal) 


9000h-FF7Fh External 


FF80h—-FFFFh Interrupt vectors 
(external) 


E000h 


FFFFh 


0000h 


2000h 


4000h 


6000h 


8000h 


A0o0h 


Ccoo0h 


E000h 


FFFFh 


Memory Space 


*641 Data Memory 


0000h—005Fh Memory-mapped registers 
0060h-007Fh Scratch-pad DARAM 
0080h-13FFh On-chip DARAM 


1400h—DFFFh_ External 


DROM=0 EOOOh-FFFFh External 


DROM=1 EOOOh-FEFFh On-chip ROM 
FFOOh-FFFFh Reserved 


Memory 3-3 


Memory Space 


Figure 3-2. Memory Maps for the ‘542 and ’543 


"542/543 Program Memory 542/543 Data Memory 


0000h 0000h 
0000h-—27FFh External 


0000h-007Fh Reserved 
0080h—27FFh On-chip DARAM 


0000h-005Fh Memory-mapped registers 
0060h-007Fh Scratch-pad DARAM 
0080h-27FFh On-chip DARAM 


2000h 2000h 


4000h 4000h 


6000h 6000h 


8000h 8000h 


2800h—EFFFh External 


2800h—FFFFh_ External 


A000h A000h 


Ccoo0h 


Co00h 


E000h E000h 


MP/MC = 0 FOOOh-F7FFh Reserved 
F800h-FF7Fh On-chip ROM 
FF80h—-FFFFh Interrupt vectors 


MP/MG =1 FOOOh-FF7Fh External 
FF80h—-FFFFh Interrupt vectors 


FFFFh FFFFh 
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Figure 3-3. Memory Maps for the ‘545 and ’546 


545/546 Program Memory "545/546 Data Memory 


0000h 0000h 


0000h-005Fh Memory-mapped registers 
0060h-007Fh Scratch-pad DARAM 
0080h-17FFh On-chip DARAM 


OVLY=0 0000h-17FFh External 


OVLY = 1 0000h-007Fh Reserved 
0080h-17FFh On-chip DARAM 


2000h 2000h 


1800h—3FFFh External 


4000h 4000h 


6000h 6000h 


1800h—BFFFh_ External 


8000h 8000h 


MP/MC =0 4000h-FF7Fh On-chip ROM 


FF80h—-FFFFh_ Interrupts 
(internal) 


Ao00h A000h 


4000h—FF7Fh External 


FF80h-FFFFh_ Interrupts 
(external) 


Co00h Cco00h 


DROM=0 COOOh-FFFFh External 


E000h E000h DROM=1 COO0h-FEFFh On-chip ROM 


FFOOh-FFFFh Reserved 


FFFFh FFFFh 


Memory 3-5 


Memory Space 


Figure 3-4. Memory Maps for the ‘548 


Hex Program 


one Reserved (OVLY = 1) 


or 
External (OVLY = 0) 


007F 


oo“ On-Chip DARAM 


(OVLY =1) 
or 


External (OVLY =0) 
1FFF 


eng On-Chip SARAM 


(OVLY =1) 
or 


External (OVLY =0) 


7FFF 


8000 


External 


FF7F 


FF80 Interrupts and 
Reserved 
(External) 

FFFF 


MP/MC = 1 


(Microprocessor Mode) 
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Hex 
0000 


007F 
0080 


1FFF 
2000 


7FFF 
8000 


EFFF 
F000 


F7FF 
F800 


FF7F 
FF80 


FFFF 


Program 


Reserved (OVLY = 1) 
or 


External (OVLY = 0) 


On-Chip DARAM 
(OVLY = 1) 
or 


External (OVLY = 0) 


On-Chip SARAM 
(OVLY = 1) 
or 


External (OVLY = 0) 


External 


Reserved 


On-Chip ROM 
(2K Words) 


Interrupts and 
Reserved 
(On-Chip) 


MP/MC = 0 


(Microcomputer Mode) 


Hex 
0000 


005F 
0060 


007F 
0080 


1FFF 
2000 


7FFF 
8000 


FFFF 


Data 


Memory-Mapped 
Registers 


Scratch-Pad RAM 


On-Chip DARAM 
(8K Words) 


On-Chip SARAM 
(24K Words) 


External 


Figure 3—5. Memory Maps for the ‘549 


Hex Program 


a Reserved (OVLY = 1) 


or 
External (OVLY =0) 


007F 


ose On-Chip DARAM 


(OVLY =1) 
or 


External (OVLY =0) 
1FFF 


e000 On-Chip SARAM 


(OVLY = 1) 
or 


External (OVLY =0) 


7FFF 


8000 


External 


FF7F 


FF80 Interrupts and 
Reserved 
(External) 

FFFF 


MP/MC = 1 
(Microprocessor Mode) 


Hex 
0000 


007F 
0080 


1FFF 
2000 


7FFF 
8000 


BFFF 
coo 


FEFF 
FFOO 


FFFF 


Program 


Reserved (OVLY = 1) 
or 


External (OVLY = 0) 


On-Chip DARAM 
(OVLY =1) 
or 


External (OVLY = 0) 


On-Chip SARAM 
(OVLY =1) 
or 


External (OVLY = 0) 


External 


On-Chip ROM 
(16K Words) 


Interrupts and 
Reserved 
(On-Chip) 


MP/MC= 0 


(Microcomputer Mode) 


Hex 
0000 


O05F 
0060 


007F 
0080 


1FFF 
2000 


7FFF 
8000 


BFFF 
coo 


FEFF 
FFOO 


FFFF 


Memory Space 


Data 


Memory-Mapped 
Registers 


Scratch-Pad RAM 


On-Chip DARAM 
(8K Words) 


On-Chip SARAM 
(24K Words) 


External 


On-Chip ROM (DROM = 1) 
or 
External (DROM = 0) 


Reserved (DROM = 1) 
or 
External (DROM = 0) 


Memory 3-7 


Memory Space 


Figure 3—6. Extended Program Memory Maps for the ‘548 and ’549 


xx 0000 010000 FT 020000 FT 7F 0000 FF 
e e e 
Page 0 | Page 1 | Page 2 | | Page 127 | 
32K 32K 32K 32K 
Wordst | Words < | Words | | Wordst - 
_ 2 _ ae _ 2 
xx 7FFF 01 FFFF ___ 02 FFFF a, __ 7F FFFF ___ 
00 8000 01 8000 02 8000 7F 8000 
e e e 
Page 127 
32K 
Words 
e e e 
00 FFFF 01 FFFF 02 FFFF 7F FFFF 
XPC =0 XPC =1 XPC =2 XPC = 127 


t See Figure 3-4 and Figure 3-5 for more information about this on-chip memory region. 
+ These pages available when OVLY = 0 when on-chip RAM is not mapped in program space or data space. When OVLY = 1 
the first 32K words are all on page 0 when on-chip RAM is mapped in program space or data space. 
NOTE: When the on-chip RAM is enabled in program space, all accesses to the region xx 0000 — xx 7FFF, regardless of page 
number, are mapped to the on-chip RAM at 00 0000 — 00 7FFF. 
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Figure 3—7. Memory Maps for the ‘5402 


Hex Page 0 Program 
0000 


Reserved 


(OVLY = 1) 


External 
(OVLY = 0) 


007F 
0080 


Dn-Chip DARAM (OVLY 
1) 


External 
(OVLY = 0) 


3FFF 
4000 


External 


FF7F 
FF80 
Interrupts 
(External) 


FFFF 


MP/MC = 1 
(Microprocessor Mode) 


Hex Page 0 Program 


0000 
Reserved 
(OVLY = 1) 


External 
(OVLY = 0) 


007F 
0080 


@n-Chip DARAM (OVLY 
1) 


External 
(OVLY = 0) 


3FFF 
4000 


EFFF 
F000 
On-Chip ROM 
(4K x 16-bits) 
FEFF 
FFOO 
Reserved 
FF7F 


FF80 
Interrupts 
(On-Chip) 


FFFF 


MP/MC= 0 
(Microcomputer Mode) 


Memory Space 


Hex Data 


0000 Memory Mapped 
Registers 


005F 


0060 Scratch-Pad 
RAM 


007F 
0080 


On-Chip DARAM 
(16K x 16-bits) 


3FFF 
4000 


EFFF 
F000 


ROM (DROM=1) or Ex- 
ternal (DROM=0) 


FEFF 
FFOO 


Reserved (DROM=1) 
or External (DROM=0) 


FFFF 
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Memory Space 


Figure 3-8. Extended Program Memory for the ‘5402 


00 0000 1 0000 2.0000 
Page 1 
Lower 
32Kt 
External 
Page 0 
1 7FFF 2 7FFF 
64K 18000 2 8000 
Wordst 
Page 1 
Upper 
32K 
External 
0 FFFF 1 FFFF 2 FFFF 


t See DMA memory map. 


Page 2 
Lower 
32Kt 

External 


Page 2 
Upper 
32K 
External 


F 0000 


F 7FFF 
F 8000 


F FFFF 


Page 15 
Lower 
32Kt 

External 


Page 15 
Upper 
32K 
External 


+ The lower 32K words of pages 1 through 15 are available only when the OVLY bit is cleared to 0. If the OVLY bit is set to 1, the 


on-chip RAM is mapped to the lower 32K words of all program space pages. 


3-10 


Figure 3-9. Memory Maps for the 5410 
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(External) 


Page 0 


Hex Program 


010000 


Mapped to 
Lower Page 0 
(OVLY = 1) 


External 
(OVLY = 0) 


017FFF 
018000 


External 


01FFFF 


Page 1 


MP/MC= 1 


(Microprocessor Mode) 


Hex 
0000 


007F 
0080 


1FFF 
2000 


7FFF 
8000 


BFFF 
coo 


FF7F 
FF80 


FFFF 


Program 


Reserved 
(OVLY = 1) 


External 
(OVLY = 0) 


On-Chip 
DARAM 
(OVLY = 1) 


External 
(OVLY = 0) 


On-Chip 
SARAM1 
(OVLY = 1) 
External 
(OVLY = 0) 


External 


(16K Words) 


Interrupts and 
Reserved 
(On-Chip ROM) 


Page 0 


Hex Program 


010000 


Mapped to 


Lower Page 0 
(OVLY = 1) 


External 
(OVLY = 0) 


017FFF 
018000 


On-Chip 
SARAM2 


01FFFF 


Page 1 


MP/MC= 0 


(Microcomputer Mode) 


Hex 
0000 


005F 
0060 


007F 
0080 


1FFF 
2000 


7FFF 
8000 


FFFF 


Memory 


Memory Space 


Data 


Memory-Mapped 
Registers 


Scratch- 
Pad RAM 


On-Chip 
DARAM 
(8K Words) 


On-Chip 
SARAM1 
(24K Words) 


On-Chip 
SARAM2 
(DROM = 1) 
External 
(DROM = 0) 
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Memory Space 


Figure 3-10. Extended Program Memory Maps for the ‘5410 
(On-chip RAM Not Mapped in Program Space and Data Space, OVLY = 0) 


00 0000 01 0000 02 0000 7F 0000 
Page 127 
64K 
Words 
00 FFFF 01 FFFF 02 FFFF 7F FFFF 
XPC =0 XPC =1 XPC =2 XPC=127 


Figure 3—11.Extended Program Memory Maps for the ‘5410 
(On-chip RAM Mapped in Program Space and Data Space, OVLY = 1) 


xx 0000 
XX 7FFF 
XPC = xx 
00 8000 01 8000 02 8000 7F 8000 
Page 0 Page 2 Page 127 
32K 32K 32K 
Words Ex- Words Words 
ternal External External 
00 FFFF 01 FFFF 02 FFFF 7F FFFF 
XPC =0 XPC =1 XPC =2 XPC=127 


t See Figure 3-9 for more information about this on-chip memory region. 
NOTE: When the on-chip RAM is enabled in program space, all accesses to the region xx 0000 — xx 7FFF, regardless of page 


number, are mapped to the on-chip RAM at 00 0000 — 00 7FFF. 
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Memory Space 


Figure 3-12. Data Memory Map for the ‘5420 Relative to CPU Subsystems A and B 


Hex Data 
0000 Memory 
Mapped Registers 
se 
0060 Scratch—Pad 
007F DARAM 
puso On-Chip 
DARAM 0 
(16k Words) 
3FFF 
4000 


On-Chip 
SARAM 1 


(16k Words) 


7FFF 
8000 


On-Chip 
SARAM 2 
(32k Words) 
Prog/Data 
(DROM=1) 
External 
(DROM=0) 


FFFF 
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Memory Space 


Figure 3—13. Program Memory Maps for the ‘5420 Relative to CPU Subsystems A and B 


Hex Program Page 0 


Hex Program Page 1 


Hex Program Page 2 


Hex Program Page 3 


vo 


0000 10000 20000 30000 
On-Chip On-Chip On-Chip On-Chip 
DARAM 0 DARAM 0 DARAM 0 DARAM 0 
(16k Words) (16k Words) (16k Words) (16k Words) 
Prog/Data Prog/Data Prog/Data Prog/Data 
(OVLY=1) (OVLY=1) (OVLY=1) (OVLY=1) 
External External External External 
(OVLY=0) (OVLY=0) (OVLY=0) (OVLY=0) 
(EMIF)t (EMIF)t (EMIF)t (EMIF)t 
3FFF 13FFF 23FFF 33FFF 
4000 14000 24000 34000 
On-Chip On-Chip On-Chip On-Chip 
SARAM 1 SARAM 1 SARAM 1 SARAM 1 
(16k Words) (16k Words) (16k Words) (16k Words) 
Prog/Data Prog/Data Prog/Data Prog/Data 64k 
(OVL=1) (OVL=1) (OVL=1) (OVL=1) Esterial 
V/O Ports 
External External External External (EMIF)t 
(OVLY=0) (OVLY=0) (OVLY=0) (OVLY=0) 
(EMIF)t (EMIF)t (EMIF)T (EMIF)t 
7FFF 17FFF 27FFF 37FFF 
8000 18000 28000 External 38000 
(EMIF)t 
On-Chip On-Chip 2EFFF 
SARAM 2 SARAM 3 2F000 


(32k Words) 
Prog/Data 


External 
(EMIF)t 


FFFF 
(extended) 


(32k Words) 


External 
(EMIF)t 


1FFFF 
(extended) 


On-Chip 
SARAM 4 
(4k Words) 


External 
(EMIF)t 


2FFFF 
(extended) 


External 
(EMIF)tT 


3FFFF 
(extended) 


t EMIF (external memory) mode is required for all external accesses. EMIF mode is when XIO pin = 1 and the MP/MC bit is 1. 
A. OVLY =1 overlays the data page and all program pages between addresses 0x0000—0x7FFF. 
B. DROM = 1 overlays 0x8000-—OxFFFF of program and data memory. 
C. Allinternal memory is divided into 8K blocks with the exception of the 4K W block on P2 (0x2F000-0x2F FFF). 
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Program Memory 


3.2 Program Memory 


The external program memory on the ’54x devices (except on the 548, ’549, 
5402, 5410, and 5420) addresses up to 64K 16-bit words. ’54x devices have 
on-chip ROM, dual-access RAM (DARAM), and single-access RAM (SARAM) 
that can be mapped by software into the program space. When the cells are 
mapped into program space, the device automatically accesses them when 
addresses fall within their bounds. When the program address generation unit 
(PAGEN) generates an address outside of the bounds of on-chip memory, the 
device automatically generates an external access. (For more information 
about program address generation, see Chapter 6, Program Memory Addres- 
sing.) Table 3-1 shows the on-chip program memory available on the various 
54x devices. 


Table 3—1. On-Chip Program Memory Available on ’54x Devices 


Device ROM DARAM SARAM 
‘341 28K 5K = 
342 2K 10K = 
543 2K 10K a 
B45 48K 6K - 
‘546 48K 6K 2 
548 2K 8K 24K 
549 16K 8K 24K 

5402 4K 16K - 
5410 16K 8K 56K 
5420 7 32K 168K 
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Program Memory 


3.2.1. Program Memory Configurability 
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The MP/MC and OVLY bits determine which on-chip memories are enabled 
in program space. 


At reset, the logic level present on the MP/MC pin is transferred to the MP/MC 
bit in the PMST register (see section 4.1, CPU Status and Control Registers, 
on page 4-2). The MP/MC bit determines whether to enable the on-chip ROM. 
If MP/MC = 1, the device is configured as a microprocessor, and the on-chip 
ROM is not enabled. If MP/MC pin = 0, the device is configured as a microcom- 
puter, and the on-chip ROM is enabled. The MP/MC pin is sampled only at 
reset; however, you can disable or enable the on-chip ROM through software 
by setting or clearing the MP/MC bit in the PMST register. 


Figure 3-1 through Figure 3-4 (pages 3-3 through 3-6) show the program 
memory configurations on the individual ‘54x devices. 


Program Memory 


3.2.2 On-Chip ROM Organization 


The on-chip ROM is subdivided and organized in blocks to enhance perfor- 
mance. For example, the block organization enables you to fetch an instruc- 
tion from one block of ROM without sacrificing data accesses that come from 
a different block of ROM. Figure 3-14 shows the way the ROM is organized 
in blocks for each 54x device. The gray lines in the figure indicate block bound- 
aries. 


Figure 3-14. On-Chip ROM Block Organization 


541 542/543 545/546t 548 549 5402 5410 


4000h 

4000 — 4FFF 
5000h 

5000 — 5FFF 
6000h 

6000 — 6FFF 
7000h 

7000 — 7FFF 
8000h 

8000 — 8FFF 
9000h [9000 - 97FF 

9000 — 9FFF 


9800 — 9FFF 
A0Q00h 

A000 — AFFF A000 — AFFF 
BO00 — BFFF 


Boo0oh 
BO00 — BFFF 
coo0h 
C000 — CFFF C000 — CFFF 
DOo00h 
D000 — DFFF D000 — DFFF 
E000 — EFFF 


E000h 
£000 — EFFF 
F000 — FFFF 
F7FF — FFFF F800 — FFFF 


F000h 
tT ROM is organized in 8K blocks on these devices. 


C000 — DFFF 


C000 — DFFF 


E000 — FFFF 


E000 — FFFF 
F000 — FFFF 


F000 — FFFF 
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Program Memory 


3.2.3. Program Memory Address Map and On-Chip ROM Contents 


At device reset, the reset, interrupt, and trap vectors are mapped to address 
FF80h in program space. However, these vectors can be remapped to the 
beginning of any 128-word page in program space after device reset. This 
feature facilitates moving the vector table out of the boot ROM and then 
removing the ROM from the memory map. For details on remapping the 
vectors, see section 6.10.9, Remapping Interrupt-Vector Addresses, on page 
6-36. 


Te, 
Note: 


In the on-chip ROM, 128 words are reserved for device-testing purposes. 
Application code written to be implemented in on-chip ROM must reserve 
these 128 words at addresses FFOOh-FF7Fh in program space. 


| 


3.2.4 On-Chip ROM Code Contents and Mapping 
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The '54x devices (except 5420) provide a variety of ROM sizes (4K, 16K, 24K, 
28K, or 48K words). On ’54x devices with on-chip bootloader ROM, the 2K 
words (at F800h to FFFFh) may contain one or more of the following, depend- 
ing on the specific device: 


_j A bootloader program that boots from the serial ports, external memory, 
an I/O port, or the host port interface (if present) 


A 256-word u-law expansion table 
A 256-word A-law expansion table 


A 256-word sine look-up table 


Do eo 


An interrupt vector table 


Figure 3-15 shows which of these items are on a particular 54x device and 
shows the addresses of each of the items. The address range for the code, 
F800h—FFFFh, is mapped to the on-chip ROM if the MP/MC bit is 0. 


Note: 


You can submit code to Texas Instruments in object file format to program 
into the ‘54x on-chip ROM. See Appendix C, Submitting ROM Codes to TI, 


for details on how to submit ROM code to Texas Instruments. 
| ee 


Program Memory 


Figure 3-15. On-Chip ROM Program Memory Map (High Addresses) 


*B41/545/546 *542/543/548/549/5402/5410 
F800h User-specified code Bootloader code 
F900h 
FAOOh | 
FBOOh | 
FCOOh | u-law expansion table 
FDOOh | A-law expansion table 
FEOOh | | Sine look-up table 
FFOOh | Reserved | Reserved 


FF80h | Interrupt vector table | | Interrupt vector table 
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3.2.5 Extended Program Memory (Available on °548/549/5402/5410/5420) 


The '548, 549, 5402, 5410, and ‘5420 use a paged extended memory 
scheme in program space to allow access of up to 8192K words of program 
memory. To implement this scheme, the 548, 549, 5402, 5410, and 5420 in- 
clude several additional features: 


[Jj 23 address lines, instead of 16 (20 address lines in the ’5402, and 18 in 
the 5420) 


[1 An extra memory-mapped register, the program counter extension 
register (XPC) 


Lj Six extra instructions for addressing extended program space 


Program memory in the ’548, 549, 5402, 5410, and 5420 is organized into 
128 pages (16 pages in the 5402, and 4 in the 5420) that are each 64K words 
in length, as shown in Figure 3-16. 


Figure 3—16. Extended Program Memory With On-Chip RAM Not Mapped in Program 
Space (OVLY = 0) 


00 0000 


00 FFFF 
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XPC =0 


01 0000 02 0000 7F 0000 


Page 127 


64K 
words 


01 FFFF 02 FFFF 7EF FFFF 


XPC = 1 XPC = 2 XPC=127 


When the on-chip RAM is enabled in program space, each page of program 
memory is made up of two parts: acommon block of 32K words maximum and 
a unique block of 32K words. The common block is shared by all pages and 
each unique block is accessible only through its assigned page. Figure 3-17 
shows the common and unique blocks. 


If the on-chip ROM is enabled (MP/MC = 0), it is enabled only on page 0. Itis 
not mapped to any other page in program memory. 


The value of the XPC register defines the page selection. This register is 
memory-mapped into data space to address 001Eh. At a hardware reset, the 
XPC is initialized to 0. 


Program Memory 


Figure 3—17. Extended Program Memory With On-Chip RAM Mapped in Program Space 


XX 


XX 


00 


00 


and Data Space (OVLY = 1) 


0000 


JEFF 


XPC = xx 


8000 01 8000 02 8000 71F 8000 
Page 127 
32K 
words 
FFFF 01 FFFF 02 FFFF TF FFFF 
XPC = 0 XPC = 1 XPC =2 XPC=127 


Note: When the on-chip RAM is enabled in program space, all accesses to the region xx 0000 — xx 7FFF, regardless of page 


number, are mapped to the on-chip RAM at 00 0000 — 00 7FFF. 


t See Figure 3-4 on page 3-6 for more information about this on-chip memory region. 


To facilitate page switching through software, the 548, ’549, 5402, 5410, and 
5420 have six special instructions that affect the XPC: 


(j FB[D] - Far branch 


[1 FBACC[D]-Far branch to the location specified by the value in accumula- 
tor A or accumulator B 


[1 FCALA|D]- Far call to the location specified by the value in accumulator 
A or accumulator B 


(J FCALL[D] - Far call 
_j FRET[D] - Far return 
Lj FRETE[D] - Far return with interrupts enabled 


The following two '54x instructions are extended in the 548, ’549, ’5402, 5410, 
and ’5420 to use 23 bits (20 bits in the 5402, and 18 in the 5420): 


_} READA- Read program memory addressed by accumulator A and store 
in data memory 


1 WRITA — Write data to program memory addressed by accumulator A 


All other instructions do not modify the XPC and access only memory within 
the current page. 
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3.3. Data Memory 


The data memory on the ’54x contains up to 64K 16-bit words. A number of 
the ’54x devices have on-chip ROM that can be mapped by software into the 
data space (DROM) in addition to any dual- and single-access RAM (DARAM 
and SARAM). Table 3—2 shows the on-chip data memory available on various 
54x devices. 


Table 3-2. On-Chip Data Memory Available on the TMS320C54x Devices 


Device Program/Data ROM DARAM SARAM 
B41 8K 5K = 
542 - 10K - 
543 = 10K = 
‘545 16K 6K — 
546 16K 6K es 
‘548 - 8K 24K 
549 16K 8K 24K 
5402 4K 16K = 
5410 16K 8K 56K 
5420 - 32K 168K 


Accesses to the RAM and the data ROM (when it is enabled) are made when 
addresses fall within the bounds of the corresponding on-chip memories. 
When the data-address generation logic (DAGEN) generates an address out- 
side of the bounds of on-chip memory, the device automatically generates an 
external access. (For more information about the generation of data address- 
es, see Chapter 5, Data Addressing.) 


3.3.1. Data Memory Configurability 
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Data memory can reside both on- and off-chip. The on-chip DARAM is mapped 
into data memory space. For some ’54x devices, you can map a portion of the 
on-chip ROM (the amount shown in Table 3—2) into data space by setting the 
DROM bit located in the PMST register (see section 4.1, CPU Status and Con- 
trol Registers, on page 4-2). This portion of on-chip ROM is enabled both in 
the data space (DROM bit) and in the program space (MP/MC bit), allowing 
an instruction to use the ROM area as a data ROM residing in data space. At 
reset, the processor clears the DROM bit to 0. 


Data Memory 


The data ROM is accessed in a single cycle by an instruction using single data- 
memory operand addressing, including an instruction with a 32-bit long word 
operand. In the dual-memory operand addressing, the access requires two 
cycles if both operands reside in the same block; if the operands reside in 
different blocks, the access requires a single cycle. For the address bound- 
aries of the ROM blocks, see section 3.2.2, On-Chip ROM Organization, on 
page 3-17. 


Figure 3-1 through Figure 3—4 (pages 3-3 through 3-6) show the data 
memory configurations on the individual ‘54x devices. 


3.3.2 On-Chip RAM Organization 


On-chip RAM is subdivided and organized in blocks to enhance performance. 
For example, the block organization enables you to fetch two operands from 
one block of DARAM and write to another block of DARAM in the same cycle. 
Figure 3-18 on page 3-24 shows the RAM block organization for each ’54x 
device. The gray lines in the figure indicate block boundaries. 


The organization of the first 1K of DARAM on all ’54x devices includes the 
memory-mapped CPU and peripheral registers, 32 words of scratch-pad DA- 
RAM, and 896 words of DARAM. 
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Data Memory 


Figure 3—18. On-Chip RAM Block Organization 
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0000h 


1000h 


2000h 


3000h 


4000h 


5000h 


6000h 


7000h 


"BAI 
[~~ o000-03FF 
| 0400-07FF | 
| 0800-0AFF | 
| 0B00-OFFF 


| 
| 
| 
1000-13FF | 


[ Dual-access RAM 


542/543 545/546 


0000-07FF 0000-07FF 
0800-OF FF | 0800-OFFF 
1000-17FF 1000-17FF 
1800-1FFF 


2000-27FF 


= Single-access RAM 


548/549 


0000—07FF 


0800-OFFF 


1000—17FF 


1800-1 FFF 


2000-3FFF 


4000-5FFF 


6000—7FFF 


Data Memory 


Figure 3-19. On-Chip RAM Block Organization ('5402/'54 10/5420) 


5420 


5402 subsystem A or B 


0060h 0080h 


8k O07FFh 
0800h 


1FFFh OFFFh 
2000h 1000h 


8k 17FFh 
1800h 


3FFFh 1FFFh 
2000h 


3FFFh 
4000h 


5FFFh 
6000h 


7FFFh 
n8000h 


n9FFFh 
nAO0OOh 


nBFFFh 
nCoo0oh 


nDFFFh 
nE000h 


nFFFFh 


e 
e 
Dual- RAM bs 
L | ners 2F000h 
ingle— RAM Ea 
[ Single—access OFFEFh 


n=page, where n=0,1,2...127 


3.3.3. Memory-Mapped Registers 


The 64K words of data memory space include the device’s memory-mapped 
registers, which reside in data page 0 (data addresses 0000h—007Fh). Data 
page 0 consists of the following: 


Lj The CPU registers (26 total) are accessible with no wait states; see 
Table 3-3 on page 3-26. 
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Data Memory 


(J Theperipheral registers are used as control and data registers in peripher- 
al circuits. These registers reside within addresses 0020h—O05F and 
reside on a dedicated peripheral bus structure. For a list of peripherals on 
a particular ‘54x device, see section 8.2, Peripheral Memory-Mapped 
Registers, on page 8-2. 


(1 The scratch-pad RAM block (60h—7Fh in data memory) includes 32 words 
of DARAM for variable storage that helps avoid fragmenting the large 
RAM block. 


3.3.4 CPU Memory-Mapped Registers 


Table 3-3 lists the CPU memory-mapped registers. This section gives a brief 
summary of one or more of the registers. 


3.3.4.1. Interrupt Registers (IMR, IFR) 


The interrupt mask register (IMR) individually masks off specific interrupts at 
required times. The interrupt flag register (IFR) indicates the current status of 
the interrupts. Interrupts are described in detail in section 6.10, /nterrupts, on 
page 6-26. 


3.3.4.2 Status Registers (STO, ST1) 


The status registers STO and ST1 contain the status of the various conditions 
and modes for the 54x devices. STO contains the flags (OVA, OVB, C, and TC) 
produced by arithmetic operations and bit manipulations, in addition to the DP 
and the ARP fields. ST1 reflects the status of modes and instructions executed 
by the processor. See section 4.1, CPU Status and Control Registers, on 
page 4-2 for detailed information. 


Table 3-3. CPU Memory-Mapped Registers 


Address 
(Hex) Name Description 
0s IMR-—sdInterruptmaskregister CS” 
1 IFR Interrupt flag register 
2-5 - Reserved for testing 
6 STO Status register 0 
7 Silat Status register 1 
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Data Memory 


Table 3-3. CPU Memory-Mapped Registers (Continued) 


Address 
(Hex) Name Description 
8 AL Accumulator A low word (bits 15-0) 
9 AH Accumulator A high word (bits 31-16) 
A AG Accumulator A guard bits (bits 39-32) 
B BL Accumulator B low word (bits 15-0) 
C BH Accumulator B high word (bits 31-16) 
D BG Accumulator B guard bits (bits 39-32) 
E T Temporary register 
F TRN Transition register 
10 ARO Auxiliary register 0 
all AR1 Auxiliary register 1 
12 AR2 Auxiliary register 2 
13 AR3 Auxiliary register 3 
14 AR4 Auxiliary register 4 
15 AR5 Auxiliary register 5 
16 AR6 Auxiliary register 6 
17 AR7 Auxiliary register 7 
18 SP Stack pointer 
19 BK Circular-buffer size register 
1A BRC Block-repeat counter 
1B RSA Block-repeat start address 
1C REA Block-repeat end address 
1D PMST Processor mode status register 
1E XPC Program counter extension register (’548, 549, 
5402, 5410, and 5420) 
1E-1F = Reserved 


3.3.4.3 Accumulators (A, B) 


The ’54x devices have two 40-bit accumulators: accumulator A and accumula- 
tor B. Each accumulator is memory-mapped and partitioned into accumulator 
low word (AL, BL), accumulator high word (AH, BH), and accumulator guard 
bits (AG, BG). See section 4.3, Accumulators A and B, on page 4-15 for more 
details about these accumulator features. 
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3.3.4.4 Temporary Register (T) 


The temporary (T) register has many uses. For example, it may hold: 


(J One of the multiplicands for multiply and multiply/accumulate instructions 
(For more details about the T register and the processes of multiplication, 
see section 4.5, Multiplier/Adder Unit, on page 4-21.) 


[J Adynamic (execution-time programmable) shift count for instructions with 
shift operation such as the ADD, LD, and SUB instructions 


(J A dynamic bit address for the BITT instruction 


(J Branch metrics used by the DADST and DSADT instructions for ACS 
operation of Viterbi decoding 


In addition, the EXP instruction stores the exponent value computed into T reg- 
ister, andthen the NORM instruction uses the T register value to normalize the 
number. 


3.3.4.5 Transition Register (TRN) 


The 16-bit transition (TRN) register holds the transition decision for the path 
to new metrics to perform the Viterbi algorithm. The CMPS (compare select 
max and store) instruction updates the contents of TRN register on the basis 
of the comparison between the accumulator high word and the accumulator 
low word. 


3.3.4.6 Auxiliary Registers (ARO-AR7) 


The eight 16-bit auxiliary registers (ARO—AR7) can be accessed by the CPU 
and modified by the auxiliary register arithmetic units (ARAUs). The primary 
function of the auxiliary registers is to generate 16-bit addresses for data 
space. However, these registers can also act as general-purpose registers or 
counters. For information about the role the auxiliary registers play in data- 
memory addressing, see section 5.5, Indirect Addressing, on page 5-10. 


3.3.4.7 Stack-Pointer Register (SP) 


The 16-bit stack-pointer register (SP) contains the address of the top of the 
system stack. The SP always points to the last element pushed onto the stack. 
The stack is manipulated by interrupts, traps, calls, returns, and the PSHD, 
PSHM, POPD, and POPM instructions. Pushes and pops of the stack 
predecrement and postincrement, respectively, the 16-bit value in the stack 
pointer. 


3.3.4.8 Circular-Buffer Size Regisiter (BK) 
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The ARAUs use16-bit circular-buffer size register (BK) in circular addressing 
to specify the data block size. For information on BK and circular addressing, 
see section 5.5.3.4, Circular Address Modifications, on page 5-15. 


Data Memory 


3.3.4.9 Block-Repeat Registers (BRC, RSA, REA) 


The 16-bit block-repeat counter (BRC) register specifies the number of times 
a block of code is to repeat when a block repeat is performed. The 16-bit block- 
repeat start address (RSA) register contains the starting address of the block 
of program memory to be repeated. The 16-bit block-repeat end address 
(REA) register contains the ending address of the block of program memory 
to be repeated. For more information about repeating multiple instructions and 
the BRC, RSA, and REA, see section 6.8, Repeating a Block of Instructions, 
on page 6-23. 


3.3.4.10 Processor Mode Status Register (PMST) 


The processor mode status register (PMST) controls memory configurations 
of the 54x devices. The PMST is described in detail in section 4.1, CPU Status 
and Control Registers, on page 4-2. 


3.3.4.11 Program Counter Extension Register (XPC, Available on ’548/549/'5402/'54 10/5420) 


The program counter extension register (XPC) contains the upper 7 bits of the 
current program memory address. See section 3.2.5, Extended Program 
Memory on page 3-20, for more information about extended memory. 
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/O Memory / Program and Data Security 


3.4 


1/O Memory 


The '54x devices offer an I/O memory space in addition to the program- and 
data-memory spaces. The I/O memory space is a 64K-word address space 
(OQOOOh-FFFFh) and exists only external to the device. Two instructions, 
PORTR and PORTW, are used to access this space. Read timings vary from 
those of the program- and data-memory spaces to facilitate access to individu- 
al l/O-mapped devices rather than to memories. For details of external bus op- 
eration and control for I/O accesses, see Chapter 10, External Bus Operation. 


3.5 Program and Data Security 


The 54x has two staged security options: on-chip ROM security and ROM/ 
RAM security. See Table 3—4 for a summary of the security feature. 


Table 3-4. Data Security 


Security 


ROM 


ROM/RAM 


Affect On-Chip Memory Accesses 


ROM 


Affected for both program and data accesses. 


Instructions fetched from on-chip ROM may 
read on-chip ROM for program and data 
accesses. 


Instructions fetched from on-chip RAM or 
external memory will be prohibited from 
accessing on-chip ROM and will read invalid 
data (OFFFFh) on the bus for program and 
data accesses. 


Affected for both program and data accesses. 


Instructions fetched from on-chip ROM and 
on-chip RAM may read on-chip ROM for 
program and data accesses. 


Instructions fetched from external memory will 
be prohibited from accessing on-chip ROM 
and will read invalid data (OFFFFh) on the bus 
for program and data accesses. 


RAM 


Unaffected by this security option. 


Affected for both program and data accesses. 


Instructions fetched from on-chip ROM and 
on-chip RAM may read on-chip RAM for 
program and data accesses. 


Instructions fetched from external memory will 
be prohibited from accessing on-chip RAM 
and will read invalid data (OFFFFh) on the bus 
for program and data accesses. 


Chapter 4 


Central Processing Unit 


This chapter describes the 54x central processing unit (CPU) operations. The 
CPU can perform high-speed arithmetic operations within one instruction 
cycle because of its parallel architectural design. 


The following CPU functional components are discussed in this chapter: 


40-bit arithmetic logic unit (ALU) 

Two 40-bit accumulator registers 

Barrel shifter supporting a —16 to 31 shift range 
Multiply/accumulate block 

16-bit temporary register (T) 

16-bit transition register (TRN) 

Compare, select, and store unit (CSSU) 
Exponent encoder 


OOUUOUOUUUL 


The CPU registers are memory-mapped, enabling quick saves and restores. 
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4.1 CPU Status and Control Registers 
The ’54x has three status and control registers: 


Lj Status register 0 (STO) 
Lj Status register 1 (ST1) 
[41 Processor mode status register (PMST) 


STO and ST1 contain the status of various conditions and modes; PMST con- 
tains memory-setup status and control information. Because these registers 
are memory-mapped, they can be stored into and loaded from data memory; 
the status of the processor can be saved and restored for subroutines and in- 
terrupt service routines (ISRs). 


4.1.1 Status Registers (STO and ST1) 


The individual bits of the STO and ST1 registers can be set or cleared with the 
SSBX and RSBxX instructions. For example, the sign-extension mode is set 
with SSBX 1, SXM, or reset with RSBX 1, SXM. The ARP, DP, and ASM bit 
fields can be loaded using the LD instruction with a short-immediate operand. 
The ASM and DP fields can be also loaded with data-memory values by using 
the LD instruction. 


The STO bits are shown in Figure 4—1 and described in Table 4-1. The ST1 
bits are shown in Figure 4—2 and described in Table 4—2 on page 4-4. 


Figure 4—1. Status Register 0 (STO) Diagram 


15-13 12 11 1 8-0 


0 9 
A 


Table 4—1. Status Register 0 (STO) Bit Summary 


Reset 
Bit Name Value Function 


15-13 ARP 0 Auxiliary register pointer. This 3-bit field selects the auxiliary register to use in the com- 
patibility mode of indirect single-operand addressing (see section 5.5, Indirect Ad- 
dressing, page 5-10). ARP must always be set to zero when the DSP is in standard 
mode (CMPT = 0). 
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Table 4-1. Status Register 0 (STO) Bit Summary (Continued) 


Bit 
12 


11 


10 


Name 
TC 


OVA 


OVB 


DP 


Reset 
Value 


{ 


Function 


Test/control flag. TC stores the results of the arithmetic logic unit (ALU) test bit op- 
erations. TC is affected by the BIT, BITF, BITT, CMPM, CMPR, CMPS, and SFTC 
instructions. The status (set or cleared) of TC determines if the conditional branch, 
call, execute, and return instructions execute. 


TC =1 if the following conditions are true: 


_] Abit tested by BIT or BITT isa 1. 


1 A compare condition tested by CMPM, CMPR, or CMPS exists between a data- 
memory value and an immediate operand, ARO and another auxiliary register, or an 
accumulator high word and an accumulator low word. 


«Bit 31 and bit 30 of an accumulator tested by SFTC have different values from 
each other. 


Carry is set to 1 if the result of an addition generates a carry; it is cleared to 0 if the 
result of a subtraction generates a borrow. Otherwise, it is reset after an addition and 
it is set after a subtraction, except for an ADD or SUB with a 16-bit shift. In these 
cases, the ADD can only set and the SUB only reset the carry bit, but they cannot 
affect it otherwise. Carry and borrow are defined at the 32nd bit position and are 
operated at the ALU level only. The shift and rotate instructions (ROR, ROL, SFTA, 
and SFTL), and the MIN, MAX, ABS, and NEG instructions also affect this bit. 


Overflow flag for accumulator A. OVA is set to 1 when an overflow occurs in either 
the ALU or the multiplier’s adder and the destination for the result is accumulator A. 
Once an overflow occurs, OVA remains set until either a reset, a BC[D], a CC[D], 
an RC[D], or an XC instruction is executed using the AOV and ANOV conditions. 
The RSBxX instruction can also clear this bit. 


Overflow flag for accumulator B. OVB is set to 1 when an overflow occurs in either 
the ALU or the multiplier’s adder and the destination for the result is accumulator B. 
Once an overflow occurs, OVB remains set until either a reset, a BC[D], a CC[D], 
an RC[D], or an XC instruction is executed using the BOV and BNOV conditions. 
This RSBX instruction can also clear this bit. 


Data-memory page pointer. This 9-bit field is concatenated with the seven LSBs of 
an instruction word to form a direct-memory address of 16 bits for single data- 
memory operand addressing. This operation is done if the compiler mode bit in ST1 
(CPL) = 0. The DP field can be loaded by the LD instruction with a short-immediate 
operand or from data memory. 
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Figure 4—2. Status Register 1 (ST1) Diagram 


14 13 12 11 
M 


15 10 9 8 7 6 5 4-0 


Table 4-2. Status Register 1 (ST1) Bit Summary 


Reset 
Bit Name Value Function 


18 BRAF 0 Block-repeat active flag. BRAF indicates whether a block repeat is currently active. 


BRAF = 0 The block repeat is deactivated. BRAF is cleared when the block- 
repeat counter (BRC) decrements below 0. 


BRAF = 1 The block repeat is active. BRAF is automatically set when an RPTB 
instruction is executed. 


14 CPL 0 Compiler mode. CPL indicates which pointer is used in relative direct addressing: 

CPL =0 The relative direct-addressing mode using the data page pointer 
(DP) is selected. 

CPL = 1 The relative direct-addressing mode using the stack pointer (SP) is 
selected. 

13 XF i XF status. XF indicates the status of the external flag (XF) pin, which is a general- 
purpose output pin. The SSBX instruction can set XF and the RSBX instruction can 
reset XF. 

12 HM 0 Hold mode. HM indicates whether the processor continues internal execution when 


acknowledging an active HOLD signal: 


HM = 0 The processor continues execution from internal program memory 
but places its external interface in the high-impedance state. 


HM = 1 The processor halts internal execution. 
11 INTM 1 Interrupt mode. INTM globally masks or enables all interrupts. 
INTM = 0 All unmasked interrupts are enabled. 


INTM = 1 All maskable interrupts are disabled. 


The SSBxX instruction sets INTM and the RSBX instruction resets INTM. INTM is set 
to 1 by reset or when a maskable interrupt trap is taken (INTR or external interrupts). 
INTM is cleared to 0 when a RETE or RETF instruction (return from interrupt) is 
executed. INTM does not affect the nonmaskable interrupts (RS and NMI). INTM 
cannot be set by memory-write operations. 


10 0 Always read as 0. 
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Table 4-2. Status Register 1 (ST1) Bit Summary (Continued) 


Bit 
g) 


6 


5 


4-0 


Name 
OVM 


SXM 


C16 


FRCT 


CMPT 


ASM 


Reset 
Value 


0 


0 


0 


0 


Function 


Overflow mode. OVM determines what is loaded into the destination accumulator 
when an overflow occurs: 


OVM =0 An overflowed result from either the ALU or the multiplier’s adder 
overflows normally in the destination accumulator. 


OVM = 1 The destination accumulator is set to either the most positive value 
(00 7FFF FFFFh) or the most negative value (FF 8000 0000h) upon 
encountering an overflow. 


The SSBX and RSBxX instructions set and reset OVM, respectively. 
Sign-extension mode. SXM determines whether sign extension is performed: 
SXM =0 Sign extension is suppressed. 

SXM = 1 Data is sign extended before being used by the ALU. 

SXM does not affect the definitions of certain instructions: the ADDS, LDU, and 
SUBS instructions suppress sign extension regardless of SXM value. The SSBX 


and RSBxX instructions set and reset SXM, respectively. 


Dual 16-Bit/double-precision arithmetic mode. C16 determines the arithmetic mode 
of the ALU’s operation: 


C16=0 The ALU operates in double-precision arithmetic mode. 
C16=1 The ALU operates in dual 16-bit arithmetic mode. 


Fractional mode. When FRCT is 1, the multiplier output is left-shifted by one bit to 
compensate for an extra sign bit. 


Compatibility mode. CMPT determines the compatibility mode for the ARP: 


CMPT =0 ARP is not updated in indirect addressing mode with a single data- 
memory operand. ARP must always be set to 0 when the DSP is in 
this mode. 


CMPT = 1 ARP is updated in indirect addressing mode with a single data- 
memory operand, except when the instruction is selecting auxiliary 
register 0 (ARO). 


Accumulator shift mode. The 5-bit ASM field specifies a shift value within a -16 
through 15 range and is coded as a 2s-complement value. Instructions with a 
parallel store, as wellas STH, STL, ADD, SUB, and LD, use this shift capability. ASM 
can be loaded from data memory or by the LD instruction using a short-immediate 
operand. 
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4.1.2 Processor Mode Status Register (PMST) 


The PMST register is loaded with memory-mapped register instructions such 
as STM. The PMST bits are shown in Figure 4—3 and described in Table 4-3. 


Figure 4—3. Processor Mode Status Register (PMST) Diagram 


15-7 6 5 4 3 2 1 0 
IPTR MP/MC | OVLY | AVIS |DROM | CLKOFF |SMULTt | SSTT 


t Refer to the device-specific datasheet to determine if these bits are supported. 


Table 4-3. Processor Mode Status Register (PMST) Bit Summary 


Reset 
Bit Name Value Function 
15-7 IPTR 1FFh Interrupt vector pointer. The 9-bit IPTR field points to the 128-word program 


page where the interrupt vectors reside. You can remap the interrupt vectors to 
RAM for boot-loaded operations. At reset, these bits are all set to 1; the reset 
vector always resides at address FF80h in program-memory space. The 
RESET instruction does not affect this field. 


6 MP/MC MP/MG Microprocessor/microcomputer mode. MP/MC enables/disables the on-chip 
pin ROM to be addressable in program memory space. 


MP/MC =0 — The on-chip ROM is enabled and addressable. 
MP/MC = 1 The on-chip ROM is not available. 


MP/MC is set to the value corresponding to the logic level on the MP/MC pin 
when sampled at reset. This pin is not sampled again until the next reset. The 
RESET instruction does not affect this bit. This bit can also be set or cleared by 
software. 


5 OVLY 0 RAM overlay. OVLY enables on-chip dual-access data RAM blocks to be 
mapped into program space. The values for the OVLY bit are: 


OVLY =0 The on-chip RAM is addressable in data space but not in 
program space. 


OVLY = 1 The on-chip RAM is mapped into program space and data space. 
Data page 0 (addresses Oh to 7Fh), however, is not mapped into 
program space. 


t Refer to the device-specific datasheet to determine if these bits are supported. 
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Table 4—3. Processor Mode Status Register (PMST) Bit Summary (Continued) 


Reset 
Bit Name Value Function 
4 AVIS 0 Address visibility mode. AVIS enables/disables the internal program address to 
be visible at the address pins. 

AVIS = 0 The external address lines do not change with the internal 
program address. Control and data lines are not affected and 
the address bus is driven with the last address on the bus. 

AVIS = 1 This mode allows the internal program address to appear at the 
pins of the ’54x so that the internal program address can be 
traced. Also, it allows the interrupt vector to be decoded in 
conjunction with IACK when the interrupt vectors reside in 
on-chip memory. 

3 DROM 0 Data ROM. DROM enables on-chip ROM to be mapped into data space. The 


values for the DROM bit are: 
DROM = 0 The on-chip ROM is not mapped into data space. 


DROM = 1 A portion of the on-chip ROM is mapped into data space. See 
Chapter 3, Memory, for details. 


2 CLKOFF 0 CLOCKOUT off. When the CLKOFF bit is 1, the output of CLKOUT is disabled 
and remains at a high level. 


1 SMULt N/A Saturation on multiplication. When SMUL = 1, saturation of a multiplication re- 
sult occurs before performing the accumulation in a MAC or MAS instruction. 
The SMUL bit applies only when OVM = 1 and FRCT = 1. 


SMUL bit allows the MAC and MAS operations to be consistent with MAC and 
MAS basic operation defined in ETSI GSM specifications (GSM specs 6.06, 
6.10, 6.53). The effect is that the result of 8000h x 8000h is saturated to 
7FF FFFFh in fractional mode, before performing subsequent addition/subtrac- 
tion required by a MAC or MAS instruction. In this mode, the MAC instruction 
is equivalent to MPY + ADD when OVM=1. If the mode is not set and OVM = 1, 
the result of the multiplication is not saturated before performing the addition/ 
subtraction, only the results of the MAC and MAS instructions are saturated. 


See Example 4—1 on page 4-9 for examples of saturation on multiplication 
operations. 


T Refer to the device-specific datasheet to determine if these bits are supported. 
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Table 4—3. Processor Mode Status Register (PMST) Bit Summary (Continued) 


Bit 
0 


Name 
ssTt 


Reset 
Value 


N/A 


Function 


Saturation on store. When SST 1, saturation of the data from the accumulator 
is enabled before storing in memory. The saturation is performed after the shift 
operation. Saturation on store takes place with the following instructions: STH, 
STL, STLM, DST, ST||ADD, ST||/LD, ST||MACR[R], ST||MAS[R], ST||MPY, and 
ST||SUB. The following steps are performed when using saturate on store: 


1) A 40-bit data value is shifted (right or left) depending on the instruction. The 
shift is the same as described in the SFTA instruction and depends on the 
SXM bit. 


2) The 40-bit data value is saturated to a 32-bit value; the saturation depends 
on the SXM bit (the number is always assumed to be positive). 


If SXM = 0, the following 32-bit value is generated: 
M@ /7FFF FFFFh if the value is greater than 7FFF FFFFh 
If SXM = 1, the following 32-bit value is generated: 
M@ /7FFF FFFFh if the value is greater than 7FFF FFFFh 
HM 8000 0000h if the value is less than 8000 0000h 
3) The data is stored in memory depending upon instruction. 
4) The accumulator contents remain unchanged during the operation. 


See Example 4—2 on page 4-10 for examples of saturation on store operations. 


t Refer to the device-specific datasheet to determine if these bits are supported. 


Example 4—1. Use of SMUL Bit 


MAC *AR1+, A 


A 
T 
AR1 
Data Memory 
100h 


MAC *AR1+, A 


A 
T 
AR1 
Data Memory 
100h 
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;SMUL=1, FRCT=1, OVM=1 


Before Instruction 


FF FFFF FFFFh 
8000h 
100h 


8000h 


, SXM=1 


Data Memory 


;SMUL=0, FRCT=1, OVM=1 


Before Instruction 


FF FFFFFFFFh 
8000h 
10th 


8000h 


Data Memory 


After Instruction 


A 00 7FFF FFFEh 
T 8000h 
AR1 101h 
100h 8000h 
, SXM=1 
After Instruction 
A 00 7FFFFFFFh 
T 8000h 
AR1 102h 
100h 8000h 
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Example 4-2. Use of SST Bit 


STHA, -4, *AR1+ ;SXM=1, SST=1 
Before Instruction 
A 7FFFFFO000h 
AR1 100h 

Data Memory 
100h 5555h 
STLA, -4, *AR1+ ;SXM=1, SST=1 
Before Instruction 
A 7FFFFFO000h 
AR1 101h 

Data Memory 
101h OAAAAh 

DSTB, *AR3 ;SXM=0, SST=1 

Before Instruction 
B 8FFFFFO000h 
AR3 103h 

Data Memory 
102h 1234h 
103h 5678h 
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A 
AR1 
Data Memory 
100h 


A 
AR1 
Data Memory 
101h 


B 
AR3 
Data Memory 
102h 
103h 


After Instruction 
7FFFFFOOO0Oh 


101h 


7FFFh 


After Instruction 
7FFFFFOOO0Oh 


102h 


OFFFFh 


8FFFFFO000h 
10%h 


OFFFFh 
OFFFFh 
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4.2 Arithmetic Logic Unit (ALU) 


The 40-bit ALU, shown in Figure 4—4, implements a wide range of arithmetic 
and logical functions, most of which execute in a single clock cycle. After an 
operation is performed in the ALU, the result is usually transferred to a destina- 
tion accumulator (accumulator A or B). Instructions that perform memory-to- 
memory operations (ADDM, ANDM, ORM, and XORM) are exceptions. 


Figure 4—4. ALU Functional Diagram 


T DB15—DBO 


D Ss Shifter output (40) 


40 740 


SXM SXM 


OVM 
C16 


A A C 


f-——_» OVA/OVB 
ZA/ZB 


40 TC 


40 A Legend: 
Accumulator A 
40 Accumulator B 
CB data bus 
DB data bus 
MAC unit 
Barrel shifter 
T register 
ALU 


MAC 
output 


CHAM=EZ0VUOWDD 


4.2.1 ALU Input 
ALU input takes several forms from several sources. 
The X input source to the ALU is either of two values: 


(1 The shifter output (a 32-bit or 16-bit data-memory operand or a shifted 
accumulator value) 
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Table 4-4. ALU Input 
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Lj Adata-memory operand from data bus DB 
The Y input source to the ALU is any of three values: 
Lj) The value in one of the accumulators (A or B) 


1 A data-memory operand from data bus CB 


_j The value in the T register 


When a 16-bit data-memory operand is fed through data bus CB or DB, the 
40-bit ALU input is constructed in one of two ways: 


Lj If bits 15 through 0 contain the data-memory operand, bits 39 through 16 
are zero filled (SXM = 0) or sign-extended (SXM = 1). 


Lj If bits 31 through 16 contain the data-memory operand, bits 15 through 0 
are zero filled, and bits 39 through 32 are either zero filled (SXM = 0) or 
sign extended (SXM = 1). 


Table 4—4 shows how the ALU inputs are obtained for the ADD instructions, 
depending on the type of syntax used. The ADD instructions execute in one 
cycle, except for cases 4, 7, and 8 that use two words and execute in two 
cycles. 


Selection for ADD Instructions 


Case _ Instruction Syntax Words A B DB CB _ Shift 
1 ADD *AR1,A 1 V V 
2 ADD *AR3,TS,A 1 V 
3 ADD *AR2,16,B,A 1 V 
4 ADD *AR1,8,B,A 2 V 
5 ADD *AR2,8,B 1 V 
6 ADD *AR2, *AR3,A 1 V V 
7 ADD #1234h,6,A,B 2 V 
8 ADD ~ #1234h, 16, A,B 2 V 
9 ADD A,12,B 1 V V 
10 ADD’ B,ASM,A 1 V 
11. DADD *AR2, A,B 1 V V 
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4.2.2 Overflow Handling 


The ALU saturation logic prevents a result from overflowing by keeping the 
result ata maximum (or minimum) value. This feature is useful for filter calcula- 
tions. The logic is enabled when the overflow mode bit (OVM) in status register 
ST1 is set. 


When a result overflows: 


Lj) If OVM = 0, the accumulators are loaded with the ALU result without 
modification. 


Lj) If OVM = 1, the accumulators are loaded with either the most positive 
32-bit value (00 7FFF FFFFh) or the most negative 32-bit value 
(OFF 8000 0000h), depending on the direction of the overflow. 


Lj The overflow flag (OVA/OVB) in status register STO is set for the destina- 
tion accumulator and remains set until one of the following occurs: 


m A reset is performed. 


m@ A conditional instruction (such as a branch, a return, a call, or an 
execute) is executed on an overflow condition. 


m The overflow flag (OVA/OVB) is cleared. 


ES 
Note: 


You can saturate the accumulator by using the SAT instruction, regardless 
of the value of OVM. 


eee sss) 


4.2.3. The Carry Bit 


The ALU has an associated carry bit (C) thatis affected by most arithmetic ALU 
instructions, including rotate and shift operations. The carry bit supports effi- 
cient computation of extended-precision arithmetic operations. The carry bit 
is not affected by loading the accumulator, performing logical operations, or 
executing other nonarithmetic or control instructions, so it can be used for 
overflow management. 


Two conditional operands, C and NC, enable branching, calling, returning, and 
conditionally executing according to the status (set or cleared) of the carry bit. 
Also, the RSBX and SSBxX instructions can be used to load the carry bit. The 
carry bit is set on a hardware reset. 
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4.2.4 Dual 16-Bit Mode 


4-14 


For arithmetic operations, the ALU can operate in a special dual 16-bit arith- 
metic mode that performs two 16-bit operations (for instance, two additions or 
two subtractions) in one cycle. You can select this mode by setting the C16 field 
of ST1. This mode is especially useful for the Viterbi add/compare/select 
operation (see section 4.6, Compare, Select, and Store Unit (CSSU), on page 
4-26). 
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4.3 Accumulators A and B 


Accumulator A and accumulator B can be configured as the destination regis- 
ters for either the multiplier/adder unit or the ALU. In addition, they are used 
for MIN and MAX instructions or for the parallel instruction LD||MAC, in which 
one accumulator loads data and the other performs computations. 


Each accumulator is split into three parts, as shown in Figure 4—5 and 
Figure 4-6. 


Figure 4—5. Accumulator A 


39-32 31-16 15-0 
Guard bits High-order bits Low-order bits 
Figure 4—6. Accumulator B 
39-32 31-16 15-0 
Guard bits High-order bits Low-order bits 


The guard bits are used as a headmargin for computations. Headmargins 
allow you to prevent some overflow in iterative computations such as 
autocorrelation. 


AG, BG, AH, BH, AL, and BL are memory-mapped registers that can be 
pushed onto and popped from the stack for context saves and restores by 
using PSHM and POPM instructions. These registers can also be used by 
other instructions that use memory-mapped registers (MMR) for page 0 
addressing. The only difference between accumulators A and B is that bits 
32-16 of Acan be used as an input to the multiplier in the multiplier/adder unit. 


4.3.1. Storing Accumulator Contents 


You can store accumulator contents in data memory by using the STH, STL, 
STLM, and SACCD instructions or by using parallel-store instructions. To store 
the 16 MSBs of the accumulator in memory with a shift, use the STH, SACCD, 
and parallel-store instructions. For right-shift operations, bits from AG and BG 
shift into AH and BH. For left-shift operations, bits from AL and BL shift into AH 
and BH, respectively. 


To store the 16 LSBs of the accumulator in memory with a shift, use the STL 
instruction. For right-shift operations, bits from AH and BH shift into AL and BL, 
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respectively, and the LSBs are lost. For left-shift operations, the bits in AL and 
BL are filled with zeros. Since shift operations are performed in the shifter, the 
contents of the accumulator remain unchanged. Example 4—3 shows the 
result of accumulator store operations with shift; it assumes that accumulator 
A= OFF 4321 1234h. 


Example 4-3. Accumulator Store With Shift 


STH A,8,TEMP ; TEMP = 2112h 
STH A,-8,TEMP ; TEMP = FF43h 
STL A,8,TEMP ; TEMP = 3400h 
STL A,-8,TEMP j; TEMP = 2112h 


4.3.2 Accumulator Shift and Rotate Operations 
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The following instructions shift or rotate the contents of the accumulator 
through the carry bit: 


.) SFTA (shift arithmetically) 

SFTL (shift logically) 

SFTC (shift conditionally) 

ROL (rotate accumulator left) 

ROR (rotate accumulator right) 

[1 ROLTC (rotate accumulator left with TC) 


Lj 
L) 
L] 
L] 


In SFTA and SFTL, the shift count is defined as—16 < SHIFT < 15. SFTAis 
affected by the SXM bit. When SXM = 1 and SHIFT is a negative value, SFTA 
performs an arithmetic right shift and maintains the sign of the accumulator. 
When SXM = 0, the MSBs of the accumulator are zero filled. SFTL is not 
affected by the SXM bit; it performs the shift operation for bits 31-0, shifting 
Os into the MSBs or LSBs, depending on the direction of the shift. 


SFTC performs a 1-bit left shift when both bits 31 and 30 are 1 or both are 0. 
This normalizes 32 bits of the accumulator by eliminating the most significant 
nonsign bit. 


ROL rotates each bit of the accumulator to the left by one bit, shifts the value 
of the carry bit into the LSB of the accumulator, shifts the value of the MSB of 
the accumulator into the carry bit, and clears the accumulator’s guard bits. 


ROR rotates each bit of the accumulator to the right by one bit, shifts the value 
of the carry bit into the MSB of the accumulator, shifts the value of the LSB of 
the accumulator into the carry bit, and clears the accumulator’s guard bits. 


The ROLTC instruction (rotate accumulator left with TC) rotates the accumula- 
tor to the left and shifts the test control (TC) bit into the LSB of the accumulator. 
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4.3.3. Saturation Upon Accumulator Store 


The SST bit in PMST determines whether or not the data in an accumulator 
is saturated before storing it in memory. The saturation is performed after the 
shift operation. Saturation on store is available with ten instructions: 


L} 
L} 
L} 
Lj 


STH  ST|| ADD  ST|| MPY 
STL Q STI|LD  ST|| SUB 
STLM ST || MACTR] 
DST ST || MAS[R] 


The following steps are performed when saturating upon accumulator store: 


1) 


3) 


The 40-bit data value is shifted (right or left) depending on the instruction. 
The shift is the same as described in the SFTA instruction and depends 
on the value of the SXM bit. 


The 40-bit value is saturated to a 32-bit value. The saturation depends on 
the value of the SXM bit (the number is always assumed to be positive): 


m SXM=0.7FFF FFFFh is generated if the 40-bit value is greater than 
or equal to 7FFF FFFFh. 


mM SXM=1.7FFF FFFFh is generated if the 40-bit value is greater than 
7FFF FFFFh. 8000 0000h is generated if the 40-bit value is less than 
8000 0000h. 


The data is stored in memory depending on the instruction (either 16-bit 
LSB, 16-bit MSB, or 32-bit data). 


The accumulator remains unchanged during this process. 


4.3.4 Application-Specific Instructions 


Each accumulator is dedicated to specific operations in application-specific 
instructions with parallel operations. These include symmetrical FIR filter 
operations using the FIRS instruction, adaptive filter operations using the LMS 
instruction, Euclidean distance calculations using the SQDST instruction, and 
other parallel operations: 


L) 


Ly 


FIRS performs operations for symmetric FIR filters by using multiply/ 
accumulates (MACs) in parallel with additions. 


LMS performs a MAC and aparallel add with rounding to efficiently update 
the coefficients in an FIR filter. 


SQDST performs a MAC and a subtract in parallel to calculate Euclidean 
distance. 
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FIRS multiplies accumulator A(82-16) with a program-memory value 
addressed by a program-memory address and adds the result to the value in 
accumulator B. At the same time, it adds the memory operands Xmem and 
Ymem, shifts the result left 16 bits, and loads this value into accumulator A. 


In the LMS instruction, accumulator B stores the interim results of the input 
sequence convolution and filter coefficients; accumulator A updates the filter 
coefficients. Accumulator A can also be used as an input for MAC, which con- 
tributes to single-cycle execution of instructions with parallel operations. 


The SQDST instruction computes the square of the distance between two 
vectors. Accumulator A(32—16) is squared and the product is added to accu- 
mulator B. The result is stored in accumulator B. At the same time, Ymem is 
subtracted from Xmem and the difference is stored in accumulator A. The val- 
ue that is squared is the value of the accumulator before the subtraction, 
Ymem — Xmem, is executed. 


Barrel Shifter 


4.4 Barrel Shifter 
The barrel shifter is used for scaling operations such as: 


Lj Prescaling an input data-memory operand or the accumulator value 
before an ALU operation 


_j Performing a logical or arithmetic shift of the accumulator value 


J Normalizing the accumulator 


1 Postscaling the accumulator before storing the accumulator value into 
data memory 


The 40-bit shifter (see Figure 4—7 on page 4-20) is connected as follows: 


Lj The input is connected to: 


m@ DB for a 16-bit data input operand 
m@ DB and CB for a 32-bit data input operand 
m Either one of the two 40-bit accumulators 


Lj The output is connected to: 


Mm One of the ALU inputs 
m The EB bus through the MSW/LSW write select unit 


The SXM bit controls signed/unsigned extension of the data operands; when 
the bit is set, sign extension is performed. Some instructions, such as LDU, 
ADDS, and SUBS operate with unsigned memory operands and do not per- 
form sign extension, regardless of the SXM value. 


The shift count determines how many bits to shift. Positive shift values corre- 
spond to left shifts, whereas negative values correspond to right shifts. The 
shift count is specified as a 2s-complement value in several ways, depending 
on the instruction type. An immediate operand, the accumulator shift mode 
(ASM) field of ST1, or T can be used to define the shift count: 


Li A4or 5-bit immediate value specified in the operand of an instruction 
represents a shift count value in the —16 to 15 range. For example: 
ADD A,-4,B ; Add accumulator A (right-shifted 


; 4 bits) to accumulator B 
, (one word, one cycle). 


SFTL A,+8 ; Shift (logical) accumulator A eight 
; bits left (one word, one cycle) 


(1 The ASM value represents a shift count value in the —16 to 15 range and 
can be loaded by the LD instruction (with an immediate operand or with 
a data-memory operand). For example: 


ADD A, ASM, B , Add accumulator A to accumulator B 
; with a shift specified by ASM 
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(1 The six LSBs of T represent a shift count value in the —16 to 31 range. For 
example: 


NORM A ; Normalize accumulator A (T 
; contains the exponent value) 


Figure 4—7. Barrel Shifter Functional Diagram 
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4.5 Multiplier/Adder Unit 


The ’54x CPU has a 17-bit x 17-bit hardware multiplier coupled to a 40-bit dedi- 
cated adder. This multiplier/adder unit provides multiply and accumulate 
(MAC) capability in one pipeline phase cycle. The multiplier/adder unit is 
shown in Figure 4-8 on page 4-22. 


The multiplier can perform signed, unsigned, and signed/unsigned multiplica- 
tion with the following constraints: 


Lj For signed multiplication, each 16-bit memory operand is assumed to be 
a 17-bit word with sign extension. 


1 For unsigned multiplication, a 0 is added to the MSB (bit 16) in each input 
operand. 


1 For signed/unsigned multiplication, one of the operands is sign extended, 
and the other is extended with a 0 in the MSB (zero filled). 


The multiplier output can be shifted left by one bit to compensate for the extra 
sign bit generated by multiplying two 16-bit 2s-complement numbers in frac- 
tional mode. (Fractional mode is selected when the FRCT bit = 1 in ST1.) 


The adder in the multiplier/adder unit contains a zero detector, a rounder (2s 
complement), and overflow/saturation logic. Rounding consists of adding 215 
to the result and then clearing the lower 16 bits of the destination accumulator. 
Rounding is performed in some multiply, MAC, and multiply/subtract (MAS) 
instructions when the suffix R is included with the instruction. The LMS instruc- 
tion also rounds to minimize quantization errors in updated coefficients. 


The adder’s inputs come from the multiplier’s output and from one of the accu- 
mulators. Once any multiply operation is performed in the unit, the result is 
transferred to a destination accumulator (A or B). 
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Figure 4—8. Multiplier/Adder Functional Diagram 
CB15—CBO 40 
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4.5.1 Multiplier Input Sources 


This section lists sources for multiplier inouts and discusses how multiplier in- 
puts can be selected for various instructions. 


The XM input source to the multiplier is any of the following values: 


1 The temporary register (T) 
1 Adata-memory operand from data bus DB 
[J Accumulator A bits 32 — 16 
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The YM input source to the multiplier is any of the following values: 


1 Adata-memory operand from data bus DB 

1 Adata-memory operand from data bus CB 

_j Aprogram-memory operand from program bus PB 
Lj Accumulator A bits 32 — 16 


Table 4—5 shows how the multiplier inputs are obtained for several instructions. 
There are a total of nine combinations of multiplier inputs that are actually used. 


For instructions using T as one input, the second input may be obtained as an 
immediate value or from data memory via a data bus (DB), or from accumula- 
tor A. 


For instructions using single data-memory operand addressing, one operand 
is fed into the multiplier via DB. The second operand may come from T, as an 
immediate value or from program memory via PB, or from accumulator A. 


For instructions using dual data-memory operand addressing, DB and CB 
carry the data into the multiplier. 


The last two cases are used with the FIRS instruction and the SQUR and 
SQDST instructions. The FIRS instruction obtains inputs from PB and accu- 
mulator A. The SQUR and SQDST obtain both inputs from accumulator A. 


Table 4—5. Multiplier Input Selection for Several Instructions 


X Multiplexer Y Multiplexer 

Case Instruction Type "T DB A. “PB CB DB A. 

1 MPY  #1234h, A V V 

2 MPY[R] *AR2, A V V 

3 MPYA B V 

4 MACP *AR2, pmad, A V V 

5 MPY *AR2, *AR3, B V V 

6 SQUR *AR2,B V V 

7 MPYA *AR2 V 

8 FIRS  *AR2, *AR3, pmad V V 

9 SQUR A,B V V 
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T provides one operand for multiply and multiply/accumulate instructions; the 
other memory operand is a single data-memory operand. T also provides an 
operand for multiply instructions with parallel load or parallel store, such as 
LD||MAC, LD||MAS, ST||MAC, ST||MAS, and ST||MPY. T can be loaded 
explicitly by instructions that support a memory-mapped register addressing 
mode or implicitly during multiply operations. 


Since bits A(82—16) can be an input to the multiplier, some sequences that 
require storing the result of one computation in memory and feeding this result 
to the multiplier can be made faster. For some application-specific instructions 
(FIRS, SQDST, ABDST, and POLY), the contents of accumulator A can be 
computed by the ALU and then input to the multiplier without any overhead. 


4.5.2 Multiply/Accumulate (MAC) Instructions 
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MAC instructions use the multiplier’s computational bandwidth to simulta- 
neously process two operands. Multiple arithmetic operations can be 
performed in a single cycle by the multiplier/adder unit. 


In the MAC, MAS, and MACSU instructions with dual data-memory operand 
addressing, data can be transferred to the multiplier during each cycle via CB 
and DB and multiplied and added in a single cycle. Data addresses for these 
operands are generated by ARAUO and ARAU1, the auxiliary register arithme- 
tic units. For information about ARAUO and ARAU1, see section 5.5.2, ARAU 
and Address-Generation Operation, on page 5-11. 


In the MACD and MACP instructions, data can be transferred to the multiplier 
during each cycle via DB and PB. DB retrieves data from data memory, and 
PB retrieves coefficients from program memory. When MACD and MACP are 
used with repeat instructions (RPT and RPTZ), they perform single-cycle MAC 
operations with sequential access of data and coefficients. Data addresses 
are generated by ARAUO and the program address register (PAR). The data- 
memory address is updated by ARAUO according to a single data-memory 
operand in the indirect addressing mode; the program-memory address is 
incremented by PAGEN. 


The repeated MACD instruction supports filtering constructs (weighted 
running average). While the sum-of-products is executed, the sample data is 
shifted in memory to make room for the next sample and to throw away the 
oldest sample. MAC and MACP instructions with circular addressing can also 
support filter implementation. The FIRS instruction implements an efficient 
symmetric structure for the FIR filter when circular addressing is used. 


Multiplier/Adder Unit 


The MPYU and MACSU instructions facilitate extended-precision arithmetic 
operations. The MPYU instruction performs an unsigned multiplication. The 
unsigned contents of T are multiplied by the unsigned contents of the ad- 
dressed data-memory location, and the result is placed in the specified accu- 
mulator. The MACSU instruction performs a signed/unsigned multiplication 
and addition. The unsigned contents of one data-memory location are multi- 
plied by the signed contents of another data-memory location, and the result 
is added to the accumulator. This operation allows operands greater than 16 
bits to be broken down into 16-bit words and then processed separately to gen- 
erate products that are larger than 32 bits. 


The square/add (SQURA) and square/subtract (SQURS) instructions pass 
the same data value to both inputs of the multiplier to square the value. The 
result is added to (SQURA) or subtracted from (SQURS) the accumulator at 
the adder level. The SQUR instruction squares a data-memory value or the 
contents of accumulator A. 


4.5.3. MAC and MAS Saturation Upon Multiplication 


When saturate-on-multiply is set (SMUL = 1), the MAC instruction is equivalent 
to MPY + ADD when OVM = 1. The effect is that the multiplication, 
8000h x 8000h, is saturated to 7FFF FFFFh in fractional mode before per- 
forming the subsequent addition (MAC) or subtraction (MAS). 


When saturate-on-multiply is not set (SMUL = 0), only the end results of MAC 
and MAS are saturated. 


When OVM = 1 and FRCT = 1, the SMUL bit in PMST determines whether or 
not the result of a multiplication is saturated before the accumulation is per- 
formed in MAC and MAS instructions. This feature allows the MAC and MAS 
operations to be consistent with the MAC and MAS basic operation defined in 
ETSI GSM specifications (GSM specifications 6.06, 6.10, and 6.53). 
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4.6 Compare, Select, and Store Unit (CSSU) 


The compare, select, and store unit (CSSU) is an application-specific hard- 
ware unit dedicated to add/compare/select (ACS) operations of the Viterbi 
operator. Figure 4—9 shows the CSSU, which is used with the ALU to perform 
fast ACS operations. 


Figure 4—9. Compare, Select, and Store Unit (CSSU) 
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The CSSU allows the ’54x to support various Viterbi butterfly algorithms used 
in equalizers and channel decoders. 


The add function of the Viterbi operator (see Figure 4—10) is performed by the 
ALU. This function consists of a double addition function (Meti + D1 and 
Met2 + D2). Double addition is completed in one machine cycle if the ALU is 
configured for dual 16-bit mode by setting the C16 bit in ST1. With the ALU 
configured in dual 16-bit mode, all the long-word (32-bit) instructions become 
dual 16-bit arithmetic instructions. 


T is connected to the ALU input (as a dual 16-bit operand) and is used as local 
storage in order to minimize memory access. Table 4—6 shows the instructions 
that perform dual 16-bit ALU operations. 


Compare, Select, and Store Unit (CSSU) 


Figure 4—10. Viterbi Operator 


Old state New state 
D1 J 
(New_Met1) 
2J 
Ment) D2 If (Met1 + D1) > (Met2 + D2) 
then New_Met1 = Meti + D1 
else 
New_Met1 = Met2 + D2 
23 +1 
(Met2) 
J + STNB/2 
(New_Met2) 
(Old metrics) (New metrics) 
Legend: STNB Number of states 
Met Path metrics 
D Branch metrics 
Table 4-6. ALU Operations in Dual 16-Bit Mode 
Instruction Function (Dual 16-Bit Mode) 
DADD_ Lmen, src [, dst] src(31—16) + Lmem(31—16) — dst(39-16) 
src(15—0) + Lmem(15-—0) — dst(15—0) 
DADST Lmem, dst Lmem(31-—16) + T > dst(39—16) 
Lmem(15—0) — T — dst(15—0) 
DRSUB Lmen, src Lmem(31—16) — src(31—16) — src(39-16) 
Lmem(15—0) — src(15—0) — src(15—0) 
DSADT Lmem, dst Lmem(31-16) — T > dst(39-16) 
Lmem(15—0) + T — dst(15—0) 
DSUB- _Lmen, src src(31—16) — Lmem(31-16) — src(39-16) 
src(15—0) — Lmem(15—0) — src(15—0) 
DSUBT Lmem, dst Lmem(31-16) — T > dst(39-16) 
Lmem(15—0) — T — dst(15—0) 
Legend: > Is stored to 
Lmem _ Long (32-bit) data-memory value 
src Source accumulator (A or B) 
dst Destination accumulator (A or B) 


x(n—m) Read as bits n through m of x 
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The CSSU implements the compare and select operation via the CMPS 
instruction, a comparator, and the 16-bit transition register (TRN). This opera- 
tion compares two 16-bit parts of the specified accumulator and shifts the 
decision into bit 0 of TRN. This decision is also stored in the TC bit of STO. 
Based on the decision, the corresponding 16-bit part of the accumulator is 
stored in data memory. Example 4—4 shows the compare and select operation 
executed by the CMPS instruction. 


Example 4—4. CMPS Instruction Operation 


CMPS  B, *AR3 ;if (B(31-16)>B(15-0)) then 
;B(31-16) ->(*AR3); TRN<<1; O->TRN(O); 
;O0->TC} 
;else B(15-0)->(*AR3); TRN<<1; 
,1->TRN(0O); 1->TC; 


TRN contains information of the path transition decisions to new states. This 
information can be used for a back-tracking routine that finds the optimal path, 
which results in decoding the code. 
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4.7 Exponent Encoder 


The exponent encoder is an application-specific hardware device dedicated 
to supporting the EXP instruction in a single cycle (see Figure 4-11). With the 
EXP instruction, the exponent value in the accumulator can be stored in T as 
a 2s-complement value within a—8 through 31 range. The exponent is defined 
as the number of leading redundant bits — 8, which corresponds to the number 
of shifts required in the accumulator to eliminate nonsignificant sign bits. This 
operation results in a negative value when the accumulator value exceeds 
32 bits. 


Figure 4—11.Exponent Encoder 


From accumulator A 
From accumulator B 


EXP encoder 


To T register 


The EXP and NORM instructions use the exponent encoder to normalize the 
accumulator’s contents efficiently. NORM supports shifting the accumulator 
value by the number of bits specified in T in a single cycle. A negative value 
in T produces a right shift of the accumulator’s contents, which normalizes any 
value beyond the 32-bit range of the accumulator. Example 4-5 demonstrates 
the normalization of accumulator A. 


Example 4—5. Normalization of Accumulator A 


;Normalize accumulator A 


EXP A ; (the number of leading bits - 8)-> T. 
ST T, EXPONENT ; Store the exponent (T) into data 

; memory 
NORM A ; Normalize accumulator A, (A) <<(T) 
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Data Addressing 


The ’54x offers seven basic addressing modes: 
1 /mmediate addressing uses the instruction to encode a fixed value. 
_j Absolute addressing uses the instruction to encode a fixed address. 


._j Accumulator addressing uses an accumulator to access a location in pro- 
gram memory as data. 


_j Direct addressing uses seven bits of the instruction to encode an offset rel- 
ative to DP orto SP. The offset plus DP or SP determine the actual address 
in data memory. 


Lj Indirect addressing uses the auxiliary registers to access memory. 


Lj Memory-mapped register addressing modifies the memory-mapped reg- 
isters without affecting either the current DP value or the current SP value. 


1 Stack addressing manages adding and removing items from the system 
stack. 
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5.1 Immediate Addressing 


In immediate addressing, the instruction syntax contains the specific value of 
the operand. Two types of values can be encoded in an instruction: 


.) Short immediate values can be 3, 5, 8, or 9 bits in length. 
_j 16-bit immediate values are always 16 bits in length. 


Immediate values can be encoded in 1-word or 2-word instructions. The 3-, 5-, 
8-, or 9-bit values are encoded into 1-word instructions; 16-bit values are en- 
coded into 2-word instructions. 


The length of the immediate value encoded in an instruction depends on the 
type of instruction used. Table 5—1 lists the ’54x instructions that can encode 
immediate values in their instruction word(s). The table also gives the bit value 
that can be encoded in the instruction. 


Table 5—1. Instructions That Allow Immediate Addressing 


5-2 


3- and 5-Bit 8-Bit 9-Bit 
Constants Constant Constant 16-Bit Constant 
LD FRAME LD ADD ORM 
LD ADDM RPT 
RPT AND RPTZ 
ANDM Sill 
BITF STM 
CMPM SUB 
LD XOR 
MAC XORM 
OR 


The syntax for immediate addressing uses a number sign (#) immediately pre- 
ceding the value or symbol to indicate that it is an immediate value. For exam- 
ple, to load accumulator A with the value 80 in hexadecimal, you would write: 


LD #80h, A 


Figure 5—1 and Figure 5-2 use the RPT instruction to show how an immediate 
value is encoded in instructions that use immediate addressing. The opcode 
in the instruction is encoded in the high half of the instruction: bits 8-15 of a 
1-word encoding; bits 0-15 of the high word of a 2-word encoding. The value 
of the constant is in the rest of the space. 


Immediate Addressing 


Figure 5—1. RPT Instruction With Short-lmmediate Addressing 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 #1 0 


1 instruction word 1 4 14 0 4 4 +0 ~0 8-bit constant 


Figure 5-2. RPT Instruction With 16-Bit-lmmediate Addressing 


15 14 13 12 11 10 9 8 YY 6 5 4 3 2 1 0 


16-bit constant 


2 instruction words 
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5.2 Absolute Addressing 
There are four types of absolute addressing: 


Lj] dmad addressing: 


m MVDK Smem, dmad 
m@ MVDM dmad, MMR 
m@ MVKD dmad, Smem 
m@ MVMD MMR, dmad 


(1 pmad addressing: 


FIRS Xmem, Ymem, pmad 
MACD Smem, pmad, src 
MACP Smem, pmad, src 
MVDP Smem, pmad 
MVPD pmad, Smem 


(1 PA addressing: 


m PORTR PA, Smem 
m PORTW Smem, PA 


(] *(/k) addressing is used with all instructions that support the use of a single 
data-memory (Smem) operand. 


Absolute addresses are always encoded with a length of 16 bits, so instruc- 
tions that encode absolute addresses are always at least two words in length. 


5.2.1. dmad Addressing 


Data-memory address (dmaa) addressing uses a specific value to specify an 
address in data space. 


The syntax for dmad addressing uses a symbol or a number to specify an 
address in data space. For example, to copy the value contained at the ad- 
dress labeled SAMPLE in data space to the memory location in data space 
pointed to by AR5, you would write: 


MVKD SAMPLE, *AR5 


In this example, the address referenced by SAMPLE is the dmad value. 
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5.2.2 pmad Addressing 


Program-memory address (pmad) addressing uses a specific value to specify 
an address in program space. 


The syntax for pmad addressing uses a symbol or a number to specify an 
address in program space. For example, to copy a word in the program- 
memory location labeled TABLE to a data-memory location specified by AR7, 
you would write: 


MVPD TABLE, *AR7- 


In this example, the address referenced by TABLE is the pmad value. 


5.2.3 PA Addressing 


Port address (PA) addressing uses a specific value to specify an external I/O 
port address. 


The syntax for PA addressing uses a symbol or a number to specify the port 
address. For example, to copy a value from the I/O port at port address FIFO 
to a data-memory location pointed to by AR5, you would write: 


PORTR FIFO, *AR5 


In the example, FIFO refers to the port address. 


5.2.4 *(lk) Addressing 
*(/k) addressing uses a specific value to specify an address in data space. 


The syntax for *(Ik) addressing uses a symbol or a number to specify an 
address in data space. For example, to load accumulator A with the value 
contained in address BUFFER in data space, you would write: 


LD * (BUFFER) ,A 


The syntax for *(Ik) addressing allows all instructions that use Smem addres- 
sing to access any location in data space without changing the DP or initializing 
an AR. When this form of absolute addressing is used, the length of the instruc- 
tion is extended by one word. For example, a 1-word instruction would become 
a 2-word instruction or a 2-word instruction would become a 3-word instruc- 
tion. The addition of one word to an instruction affects its usability in delay 
slots. 


NT 
Note: 


Instructions using the *(Ik) form of absolute addressing cannot be used with 


repeat single instructions (RPT, RPTZ). 
ss) 
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5.3 Accumulator Addressing 
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Accumulator addressing uses the accumulator as an address. This address- 
ing mode is used to address program memory as data. 


Two instructions allow you to use the accumulator as an address: 


Lj READA Smem 
lL) WRITA Smem 


READA transfers a word from a program-memory location specified by accu- 
mulator A to a data-memory location specified by the single data-memory 
(Smem) operand of the instruction. 


WRITA transfers a word from a data-memory location specified by the Smem 
operand of the instruction to a program-memory location specified by accumu- 
lator A. 


In repeat mode, an increment may be used to increment accumulator A. 


Cc OOO 
Note: 


In most 54x devices, the program-memory location is specified by the lower 
16 bits of accumulator A. However, because the 548, 549, and 5410, have 
23 address lines (20 address lines in the 5402, and 18 in the 5420), the pro- 
gram-memory location in these devices is specified by the lower 23 bits of 
accumulator A. See section 3.2.5, Extended Program Memory, on 
page 3-20. 
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5.4 Direct Addressing 


In direct addressing mode, the instruction contains the lower seven bits of the 
data-memory address (dma). The 7-bit dma is an address offset that is com- 
bined with a base address, with the data-page pointer (DP), or with the stack 
pointer (SP) to form a 16-bit data-memory address. Using this form of addres- 
sing, you can access any of 128 locations in random order without changing 
the DP or the SP. 


aa | 


Note: 


Direct addressing is not the only method of offset addressing. However, the 
advantage of this mode is that it encodes each instruction and address into 
a single word. 


Either DP or SP can be combined with the dma offset to generate the actual 
address. The compiler mode bit (CPL), located in status register ST1, selects 
which method is used to generate the address: 


[1 WhenCPL =0, the dmafield is concatenated with the 9-bit DP field to form 
the 16-bit data-memory address. 


[1 When CPL = 1, the dma field is added (positive offset) to SP to form the 
16-bit data-memory address. 


The syntax for direct addressing uses a symbol or anumber to specify the off- 
set value. For example, to add the contents of the memory location SAMPLE 
to accumulator B, provided that the correct base address is in DP (CPL = 0) 
or SP (CPL = 1), you would write: 


ADD SAMPLE, B 


The lower seven bits of the address of SAMPLE are stored in the instruction 
word. 


Figure 5—3 shows the opcode format for instructions that use direct addressing. 
Table 5—2 describes the bits of the direct-addressing instruction. Figure 5—4 
illustrates how the 16-bit data address is formed. 
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Figure 5—3. Direct-Addressing Instruction Format 


15-8 7 6-0 


es 


Table 5—2. Direct-Addressing Instruction Bit Summary 


Bit Name Function 
15-8 Opcode This eight-bit field contains the operation code for the instruction. 
7 | | = 0, the addressing mode used by the instruction is the direct addressing mode. 


6-0 dma This seven-bit field contains the data-memory address offset for the instruction. 


Figure 5-4. Direct Addressing Block Diagram 


7 LSBs from IR (dma) 


SP(16) ® 


DAB(16) (read) 


CPL] cpL DAGEN 


0 EA=DP:offset(IR) | EAB(16) (write) 


1 EA=SP + offset(IR) ar 
CAB(16) 


(32-bit read) 


Data bus DB(16) 


Data bus EB(16) 


Legend: EA Effective address 
IR Instruction register 


Direct Addressing 


5.4.1. DP-Referenced Direct Addressing 


In DP-referenced direct addressing, the 7-bit dma in the instruction register is 
concatenated with the 9-bit DP to form the address. Figure 5—5 shows how the 
two values make up the resulting address. 


Figure 5-5. DP-Referenced Direct Address 
1S 14 43 A a. hs Se Se 


Value from the DP Value from the IR (dma) 


DP-referenced direct addressing divides memory into 512 pages, because the 
DP’s range is from 0 to 511 (29-1). Each page has 128 addressable locations, 
because the dma ranges from 0 to 127 (27 — 1). 


In other words, the DP points to one of 512 possible 128-word data-memory 
pages; the dma points to the specific location within that page. The only differ- 
ence between an access to location 0 on page 1 and to location 0 on page 2 
is the value of the DP. 


The DP is loaded by the LD instruction. 


5.4.2 SP-Referenced Direct Addressing 


In SP-referenced direct addressing, the 7-bit dma in the instruction register is 
added as a positive offset to the SP to form the effective 16-bit data-memory 
address. Figure 5—6 shows how the two values combine to form the resulting 
address. 


Figure 5-6. SP-Referenced Direct Address 
15.14 13 12 11:10 9 8 7 6 S&S 4 3 2 1 6 


Value from the SP 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 =#0 
+ 0 0 0 0 0 0 0 0 0 Value from the IR (dma) 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 O 


Effective memory address 


The SP points to any address in memory. The dma points to the specific loca- 
tion on the page, allowing you to access a contiguous 128-word (27 — 1) block 
in memory from any base address. 

SP can also add or remove items from the stack. See section 5.7, Stack 
Addressing, for more information. 
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5.5 Indirect Addressing 


5.5.1 


In indirect addressing, any location in the 64K-word data space can be 
accessed via a 16-bit address contained in an auxiliary register. The 54x has 
eight 16-bit auxiliary registers (ARO—AR7). Indirect addressing is used mainly 
when there is a need to step through sequential locations in memory in fixed- 
size steps. 


When memory is addressed with indirect addressing, the auxiliary register and 
the address can be optionally modified by a decrement, an increment, an off- 
set, or an index. Special modes offer circular and bit-reversed addressing. A 
circular buffer size register (BK) is used with circular addressing. The ARO reg- 
ister is used for indexed and bit-reversed addressing modes in addition to 
being used to point to memory as the other auxiliary registers do. 


Indirect addressing is flexible enough not only to read or write a single 16-bit 
data operand from memory with one instruction, but also to access two data- 
memory locations with one instruction. Accesses of two data-memory loca- 
tions include reads of two independent memory locations, reads and writes of 
two consecutive memory locations, and a read of one memory location com- 
bined with a write to a memory location. 


Single-Operand Addressing 


Figure 5—7 shows the indirect-addressing instruction format for a single data- 
memory (Smem) operand. Table 5—3 describes the bits of the instruction. 


Figure 5—7. Indirect-Addressing Instruction Format for a Single Data-Memory Operand 


15-8 7 6-3 2-0 


Table 5—3. Indirect-Addressing Instruction Bit Summary — Single Data-Memory 


Operand 
Name Function 
Opcode This eight-bit field contains the operation code for the instruction. 


| 
MOD 


| = 1, the addressing mode used by the instruction is the indirect addressing mode. 


This 4-bit modification field defines the type of indirect addressing. section 5.5.3, Single- 
Operand Address Modifications, on page 5-13, describes the 16 ways to specify addres- 
sing types with the MOD field. 
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Table 5-3. Indirect-Addressing Instruction Bit Summary — Single Data-Memory 


Operand (Continued) 
Bit Name Function 
2-0 ARF This 3-bit auxiliary register field defines the auxiliary register used for addressing. ARF 


depends on the compatibility mode bit (CMPT) in status register ST1: 


CMPT=0 Standard mode. In standard mode, ARF always specifies the auxiliary regis- 
ter, regardless of the value in ARP. ARP is not updated. ARP must always be 
set to zero when the DSP is in this mode. 


CMPT=1 Compatibility mode. In compatibility mode, ARP selects the auxiliary register 
if ARF = 0. Otherwise, ARF selects the auxiliary register and the ARF value 
is loaded into ARP when the access is completed. *ARO in the assembly 
instruction indicates the auxiliary register selected by ARP in compatibility 
mode. 


ae | 


Note: 


In some cases, two data operands can be fetched at once. This requires a 
different instruction format. section 5.5.4, Dual-Operand Address Modifica- 
tions, on page 5-19, describes this format. 


| a) 


5.5.2 ARAU and Address-Generation Operation 
Two auxiliary register arithmetic units (ARAUO and ARAU1) operate on the 
contents of the auxiliary registers. The ARAUs perform unsigned, 16-bit 
auxiliary register arithmetic operations. Some addresses can be obtained by 
premodifying the auxiliary register. 
The auxiliary registers can be: 


(j Loaded with an immediate value using the STM instruction 


_j Loaded via the data bus by writing to the memory-mapped auxiliary 
registers 


Lj Modified by the indirect addressing field of any instruction that supports 
indirect addressing 


1 Modified by the modify auxiliary register (MAR) instruction 


Used as loop counters using the BANZ[D] instruction 
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Te | 


Note: 


Typically, STM or MVDK is used to load auxiliary registers. Both of these 
instructions allow the next instruction to use the new value in the register. 
Other instructions that load a new value into an AR produce a pipeline laten- 
cy. For further information on the pipeline and possible pipeline conflicts, see 
Chapter, 7 Pipeline. 


| 


Figure 5-8 shows the ARAUs used to generate an address in the indirect 
addressing mode using a single data-memory operand. As the figure shows, 
the main components used for address generation in indirect addressing are 
the auxiliary register arithmetic units (ARAUO and ARAU1) and the auxiliary 
registers (ARO—AR7). 


Figure 5-8. Indirect Addressing Block Diagram for a Single Data-Memory Operand 
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Data bus EB(16) 
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5.5.3 Single-Operand Address Modifications 


You can modify the addresses you use in instructions before or after they are 
accessed, or you can leave them unchanged. You can modify them by incre- 
menting or decrementing the address by 1, adding a 16-bit offset, or indexing 
with the value in ARO. These three types of action combined with taking the 
action either before or after the access, plus the ways of leaving the address 
unchanged make a total of 16 addressing types, each assigned to a value of 
MOD, the 4-bit modification field in the encoding of an instruction using indirect 


addressing. 


Table 5-4 lists the types of single data-memory operand addressing, along 
with the value of MOD, the assembler syntax, and the function for each type. 


Table 5—4. Indirect Addressing Types With a Single Data-Memory Operand 


MOD 
Field 


0000 (0) 
0001 (1) 


0010 (2) 


0011 (3) 


0100 (4) 


0101 (5) 


0110 (6) 


0111 (7) 


1000 (8) 


1001 (9) 


Operand 
Syntax 


*ARX 
*ARX— 


*ARX+ 


*+ARX 


*ARx—0B 


*ARx—0 


*ARx +0 


*ARx+0B 


*ARx—% 


*ARX—0% 


Function 
addr = ARx 


addr = ARx 
ARx = ARx — 1 


addr = ARx 
ARx = ARx + 1 


addr = ARx + 1 
ARx = ARx + 1 


addr = ARx 
ARx = B(ARx — ARO) 


addr = ARx 
ARx = ARx — ARO 


addr = ARx 
ARx = ARx + ARO 


addr = ARx 
ARx = B(ARx + ARO) 


addr = ARx 
ARx = circ(ARx — 1) 


addr = ARx 


ARx = circ(ARx — ARO) 


Descriptiont 
ARx contains the data-memory address. 


After access, the address in ARx is decremented.t 
After access, the address in ARx is incremented.+ 
The address in ARx is incremented before its use.+8# 


After access, ARO is subtracted from ARx with reverse 
carry (rc) propagation. 


After access, ARO is subtracted from ARx. 
After access, ARO is added to ARx. 


After access, ARO is added to ARx with reverse carry (rc) 
propagation. 


After access, the address in ARx is decremented with 
circular addressing.+ 


After access, ARO is subtracted from ARx with circular 
addressing. 


Tt ARx is used as the data-memory address unless otherwise specified. 

+ Increment/decrement value is 1 for 16-bit word access and 2 for 32-bit word access. 

§ This mode is not allowed in memory-mapped register addressing. 

{This mode is discussed in greater detail in section 5.2.4, *(/k) Addressing, on page 5-5. 
# This mode is allowed only for write accesses. 
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Table 5-4. Indirect Addressing Types With a Single Data-Memory Operand (Continued) 


MOD 
Field 
1010 (10) 
1011 (11) 


1100 (12) 


1101 (13) 


1110 (14) 


1111 (15) 


Operand 
Syntax 
*ARX+% 
*ARX+0% 


*ARx(Ik) 


*4ARx(Ik) 


*LARX(Ik)% 


*(Ik) 


Function Descriptiont 

addr = ARx After access, the address in ARx is incremented with 

ARx = circ(ARx + 1) circular addressing.+ 

addr = ARx After access, ARO is added to ARx with circular addressing. 

ARx = circ(ARx + ARO) 

addr = ARx + Ik The sum of ARx and the 16-bit long offset (Ik) is used as 

ARx = ARx the data-memory address. ARx is not updated. 

addr = ARx + Ik The address in ARx is incremented before its use and 

ARx = ARx + Ik added to the signed 16-bit long offset (Ik). It is then used 
as the data-memory address.§ 

addr = circ(ARx + Ik) The address in ARx is incremented before its use and 

ARx = circ(ARx + Ik) added to a signed 16-bit long offset (Ik) with circular ad- 


dressing. It is then used as the data-memory address.$ 


addr = Ik An unsigned 16-bit long offset (Ik) is used as the absolute 
address of data memory (absolute addressing).§4 


Tt ARx is used as the data-memory address unless otherwise specified. 

+ Increment/decrement value is 1 for 16-bit word access and 2 for 32-bit word access. 

§ This mode is not allowed in memory-mapped register addressing. 

{This mode is discussed in greater detail in section 5.2.4, *(/k) Addressing, on page 5-5. 
# This mode is allowed only for write accesses. 


5.5.3.1 


Increment/Decrement Address Modifications (MOD = 0, 1, 2, or 3) 


While an AR is being used, you can modify the AR by incrementing or decre- 
menting its value. 


The syntaxes for using the AR without modification, postdecrementing the AR 
by 1, postincrementing the AR by 1, and preincrementing the AR by 1 are 
shown in Table 5—4 for MOD = 0, 1, 2, and 3, respectively. 


Preincrementing (*+ARx) is supported only in instructions that access oper- 
ands in a write operation. 


5.5.3.2 Offset Address Modifications (MOD = 12 or 13) 
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Offset addressing is a type of indirect addressing in which a predetermined off- 
set, or step size, is added to the contents of an auxiliary register. There are two 
options for offset addressing. In both cases, a 16-bit long offset, which is part 
of the instruction, is added to the value in the auxiliary register, and the result 
is used to address a location in data memory. In the first case, the auxiliary reg- 
ister is not updated. In the second case, the auxiliary register is updated with 
the new address. 


Indirect Addressing 


This type of addressing is useful in accessing a specific element of an array 
or structure, especially when the auxiliary register is not updated. When the 
auxiliary register is updated, this type of addressing is especially useful for 
stepping through an array in fixed-size steps. 


The syntaxes for offset addressing of an AR without and with updating the AR 
using offset addressing are shown in Table 5—4 in MOD 12 and 13, respectively. 


aS SSS—S——eSeSSNSNSNSESEEeeeeeeeeeeeeeeeeeeeeeeeeeeeEeEeEeEeEeEeEeEeEeEeEeEeEeEyeEeEeEeEeEeEoEoEOoEeeeeeeEeEeEeESESEeEeEeEeEeEeE=E=E=eeSSsSasasSsS——rt 
Notes: 


1) Instructions using offset addressing cannot be repeated using the repeat 
single instruction. 


2) Premodification by a 16-bit word offset (*+ARx(Ik)) uses an extra cycle 
because the instruction code has two or three words. The last word is the 


offset. 
Nee 


5.5.3.3. Indexed Address Modifications (MOD = 5 or 6) 


Indexed addressing is a type of indirect addressing in which the contents of 
ARO are added to, or subtracted from, any other auxiliary register, ARx. 
Indexed addressing differs from offset addressing in that the index or step size 
can be determined during code execution. Because the index is determined 
during code execution, you can easily make adjustments to the step size. 
Indexed addressing also offers an advantage over offset addressing: it does 
not require an additional word for the instruction. 


The syntaxes for subtracting ARO from ARx and for adding ARO to ARx are 
shown in Table 5—4 for MOD = 5 and 6, respectively. 


5.5.3.4 Circular Address Modifications (MOD = 8, 9, 10, 11, or 14) 


Many algorithms, such as convolution, correlation, and FIR filters, require the 
implementation of a circular buffer in memory. In these algorithms, a circular 
buffer is a sliding window containing the most recent data. As new data comes 
in, the buffer overwrites the oldest data. The key to the implementation of a 
circular buffer is the implementation of circular addressing. 
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The circular-buffer size register (BK) specifies the size of the circular buffer. 
A circular buffer of size R must start on a N-bit boundary (that is, the N LSBs 
of the base address of the circular buffer must be 0), where N is the smallest 
integer that satisfies 2N > R. The value R must be loaded into BK. For example, 
a 31-word circular buffer must start at an address whose five LSBs are 0 (that 
is, XXXX XXXX XXXO0 00009), and the value 31 must be loaded into BK. As 
a second example, a 32-word circular buffer must start at an address whose 
six LSBs are 0 (that is, XXXX XXXX XX00 0000s), and the value 32 must be 
loaded into BK. In some applications, however, it may be possible to use bit- 
reversed addressing to place a 2N buffer on a 2N boundary and offer the effect 
of circular addressing. 


The effective base address (EFB) of the circular buffer is determined by zero- 
ing the N LSBs of a user-selected auxiliary register (ARx). The end of buffer 
address (EOB) of the circular buffer is determined by replacing the N LSBs of 
ARx with the N LSBs of BK. The index of the circular buffer is simply the NLSBs 
of ARx and the stepis the quantity being added to or subtracted from the auxil- 
iary register. Follow these three rules when you use circular addressing: 


C) Place the first (lowest) address of the circular buffer on a 2N boundary 
where 2N is larger than the circular buffer size. 


1 Use a step less than or equal to the circular buffer size. 


Lj The first time the circular queue is addressed, the auxiliary register must 
point to an element in the circular queue. 


The algorithm for circular addressing is as follows: 


IfO < index + step < BK: 
index = index + step. 

Else if index + step = BK: 
index = index + step — BK. 

Else if index + step < 0: 
index = index + step + BK. 


Circular addressing can be used for single data-memory or dual data-memory 
operands. When BK is zero, the circular modifier results in no circular address 
modification. This is especially useful when a dual operand must perform an 
address modification equivalent to ARx+0. 


Figure 5—9 illustrates the relationships among BK, the auxiliary register (ARx), 
the bottom of the circular buffer, the top of the circular buffer, and the index into 
the circular buffer. 


Figure 5-10 shows how the circular buffer is implemented and illustrates the 
relationship between the generated values and the elements in the circular 
buffer. 
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Figure 5-9. Circular Addressing Block Diagram 
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Figure 5—10. Circular Buffer Implementation 
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Circular addressing typically uses a decrement or an increment by one 
(MOD = 8 and 10) or a decrement or an increment by an index (MOD = 9 and 
11). Premodification by a 16-bit word offset (*+ARx(Ik)%) requires an extra 
code word so that the instruction code has two or three words. The last word 
is the offset. An instruction using indirect-offset addressing cannot be repeated 
using a single repeat operation. 


The syntaxes for each of the five types of circular addressing are shown in 
Table 5—4 for MOD = 8, 9, 10, 11, and 14. 


5.5.3.5 Bit-Reversed Address Modifications (MOD = 4 or 7) 


Bit-reversed addressing enhances execution speed and program memory for 
FFT algorithms that use a variety of radixes. In this addressing mode, ARO 
specifies one half of the size of the FFT. The value contained in ARO must be 
equal to 2N-1, where N is an integer, and the FFT size is 2N. An auxiliary regis- 
ter points to the physical location of a data value. When you add ARO to the 
auxiliary register using bit-reversed addressing, the address is generated in 
a bit-reversed fashion, with the carry bit propagating from left to right, instead 
of the normal right to left. 


The syntaxes for each of the two bit-reversed addressing modes are shown 
in Table 5-4 for MOD 4 and 7, respectively. 


Assume that the auxiliary registers are eight bits long, that AR2 represents the 
base address of the data in memory (011000005), and that ARO contains the 
value 000010005. Example 5-1 shows a sequence of modifications of AR2 
and the resulting values of AR2. 


Example 5—1. Sequence of Auxiliary Registers Modifications in Bit-Reversed Addressing 


*AR2+0B 7 AR2 = 0110 0000 (Oth value) 
*AR2+0B ,AR2 = 0110 1000 (lst value) 
*AR2+0B 7 AR2 = 0110 0100 (2nd value) 
*AR2+0B 7 AR2 = 0110 1100 (3rd value) 
*AR2+0B ;AR2 = 0110 0010 (4th value) 
*AR2+0B ;AR2 = 0110 1010 (5th value) 
*AR2+0B ;AR2 = 0110 0110 (6th value) 
*AR2+0B ;AR2 = 0110 1110 (7th value) 


Table 5-5 shows the relationship of the bit pattern of the index steps and the 
four LSBs of AR2, which contain the bit-reversed address. 


See the 7MS320C54x DSP Reference Set, Volume 4: Applications Guide for 
an application of the bit-reversed addressing mode. 
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Table 5-5. Bit-Reversed Addresses 


Step Bit Pattern Bit-Reversed Pattern Bit-Reversed Step 
0 0000 0000 0 
1 0001 1000 8 
2 0010 0100 4 
3 0011 1100 12 
4 0100 0010 2 
5 0101 1010 10 
6 0110 0110 6 
7 0111 1110 14 
8 1000 0001 i 
9 1001 1001 9 
10 1010 0101 
11 1011 1101 13 
12 1100 0011 3 
13 1101 1011 11 
14 1110 0111 7 
15 1114 1111 15 


5.5.4 Dual-Operand Address Modifications 


Dual data-memory operand addressing is used for instructions that perform 
two reads or a single read and a parallel store (indicated by two vertical bars, 
||) at the same time. These instructions are all one word long and operate in 
indirect addressing mode only. Two data-memory operands are represented 
by Xmem and Ymem: 


[1 Xmem is a read operand with access through the D bus. Store instruc- 
tions, for example STH and STL with shift operation, change Xmem to a 
write operand. 


Lj Ymemis used as a read operand in instructions with dual reads (accessed 
through the C bus) or as a write operand in instructions with a parallel store 
(accessed through the E bus). 


If the source operand and the destination operand point to the same location, 
in instructions with a parallel store (for example, ST||LD), the source is read 
before writing to the destination. If a dual-operand instruction (for example, 
ADD) points to the same auxiliary register with different addressing modes 
specified for both operands, the mode defined by the Xmod field is used for 
addressing. 
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Figure 5—11 shows the indirect-addressing instruction format for a dual data- 
memory operand. Table 5-6 describes the bits of the instruction. 


Because only two bits are available for selecting each auxiliary register in this 
mode, only four of the auxiliary registers can be used, AR2 — AR5. Table 5—7 
shows which Xar or Yar value selects which auxiliary registers. 


Figure 5—11.Indirect-Addressing Instruction Format for Dual Data-Memory Operands 
15-8 7 


6 5 4 3 2 1 0 


Table 5—6. Indirect-Addressing Instruction Bit Summary — Dual Data-Memory Operands 


Bit Name Function 


15-8 Opcode This eight-bit field contains the operation code for the instruction. 


7-6 Xmod This 2-bit field defines the type of indirect addressing mode used for accessing the Xmem 
operand. 

5-4 Xar The 2-bit Xmem auxiliary register selection field defines the auxiliary register that contains 
the address of Xmem. 

3-2 Ymod This 2-bit field defines the type of indirect addressing mode used for accessing the Ymem 
operand. 

1-0 Yar The 2-bit Ymem auxiliary register selection field defines the auxiliary register that contains 


the address of Ymem. 


Table 5—7. Auxiliary Registers Selected by Xar and Yar Field of Instruction 


Xar or Yar 
Field Auxiliary Register 
00 AR2 
01 AR3 
10 AR4 
11 AR5 


Figure 5-12 shows how an address is generated using dual data-memory 
operand addressing. 

Dual data-memory operand addressing uses four auxiliary registers (AR2—ARS5). 
The ARAUs, together with these registers, provide the capability to access two 
operands in a single cycle. 
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Figure 5-12. Indirect Addressing Block Diagram for Dual Data-Memory Operands 
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Table 5-8 lists the types of dual data-memory operand addressing, along with 
the value of the modification field (either Xmod or Ymod), the assembler 
syntax, and the function for each type. 


Table 5-8. Indirect Addressing Types With Dual Data-Memory Operands 


Xmod or Operand 


Ymod Field Syntax Function Descriptiont 
00 (0) *ARX addr = ARx ARx is the data-memory address. 
01 (1) *ARx— addr = ARx After access, the address in ARx is decrem- 
ARx = ARx — 1 ented. 
10 (2) *ARX+ addr = ARx After access, the address in ARx is incremented. 
ARx = ARx + 1 
11 (3) *ARx+0% addr = ARx After access, ARO is added to ARx using circular 


ARx = circ(ARx + ARO) addressing.+ 


Tt ARx is used as the data-memory address unless otherwise specified. 
+ The size of the circular buffer is specified in circular-buffer size register (BK) 
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In each case, the content of the auxiliary register is used as the data-memory 
operand. After using the address in the auxiliary register, the ARAUs perform 
the specified mathematical operation. By disabling circular modifications, it is 
possible to perform indexed addressing or the equivalent of *~ARx+0. Clearing 
the BK to 0 disables circular modification. 


In instructions that perform dual-operand reads, if the auxiliary 
register specified by the Yar field accesses one of the 
memory-mapped registers, the value read will not represent the 


contents of the register. 


See the TMS320C54x DSP Reference Set, Volume 4: Applications Guide for 
examples of dual-operand indirect addressing. 


5.5.4.1 Dual-Operand Increment/Decrement Address Modifications (Xmod or Ymod = 0, 1, or 2) 


You can modify the AR by incrementing or decrementing its value. When 
Xmod or Ymod = 0, ARx is used as the data-memory address with no incre- 
menting or decrementing. When Xmod or Ymod = 1, ARx is decremented after 
the access is made. When Xmod or Ymod = 2, ARx is incremented after the 
access is made. 


5.5.4.2 Dual-Operand Indexed Address Modifications (Xmod or Ymod = 3 and BK = 0) 


When Xmod or Ymod = 3 and BK = 0, ARO is added to ARx after each access. 
Otherwise, dual-operand indexed addressing is exactly as described in sec- 
tion 5.5.3.3 on page 5-15. 


5.5.4.3 Dual-Operand Circular Address Modifications (Xmod or Ymod = 3 and BK + 0) 


When Xmod or Ymod = 3 and BK + 0, ARO is added to ARx using circular 
addressing after each access. Otherwise, dual-operand circular addressing is 
exactly as described in section 5.5.3.4 on page 5-15. 


5.5.4.4 Single-Operand Instructions That Use the Dual-Operand Format 


5-22 


Some instructions with only one data-memory operand use dual data-memory 
operand addressing so that they fit in a single word for single-cycle execution. 
In these instructions, only Xmem is available and the Xmod and Xar fields 
define the addressing mode for the operand. Four single-operand instructions 
can be executed in a single cycle: 


Lj) BIT Xmem, BITC 

1 SACCD src, Xmem, cond 
Li SRCCD Xmem, cond 

Li STRCD Xmem, cond 
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Five instructions with optional shift also support this type of addressing for 


single-word, single-cycle execution: 


ADD Xmem, SHFT, src 
LD Xmem, SHFT, dst 

STH sre, SHFT, Xmem 
STL sre, SHFT, Xmem 
SUB Xmem, SHFT, src 


OOUUOU 


5.5.5 TMS320C2x/C20x/C24x/C5x Compatibility (ARP) Mode 


ARP can be used in indirect addressing. This allows the AR to be defined by 
ARP to ease code translation from a ’C2x/C20x/C24x/C5x device. With CMPT 
= 1 and ARF = 0, ARP is used to determine which AR is used to address 
memory. Figure 5—13 shows how the ARP indexes the auxiliary registers. 


In using ARP, the ’54x differs from the ’C5x in that when the ’54x uses the AR 
pointed to by ARP, the 54x does not update the ARP with the same instruction. 
Table 5-9 shows the assembler syntax for the ’C2x/C20x/C24x/C5x 


compared to the ’54x. 


Figure 5-13. How ARP Indexes the Auxiliary Registers 
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Syntax for Syntax for Syntax for Syntax for 

*C2x/C20x/C24x/C5x 54x *C2x/C20x/C24x/C5x 54x 

a! *ARO *Q- *ARO-0 

* *ARO- *O+ *ARO+0 

*y *ARO+ *BRO- *ARO-—0B 
*BRO+ *ARO+0B 
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Figure 5-14 shows the indirect-addressing instruction format for the ARP 
mode. Table 5-10 describes the bits of the ARP-mode instruction. 


Figure 5—14. Indirect-Addressing Instruction Format for Compatibility Mode 


15-8 7 6-3 2-0 


Table 5-10. Indirect-Addressing Instruction Bit Summary — Compatibility Mode 


Bit Name Function 


15-8 Opcode This eight-bit field contains the operation code for the instruction. 
7 | | = 1, the addressing mode used by the instruction is the indirect addressing mode. 


6-3 MOD This 4-bit modification field defines the type of indirect addressing. section 5.5.3, Single- 
Operand Address Modifications, on page 5-13, describes the 16 ways to specify addres- 
sing types with the MOD field. 


2-0 ARF This 3-bit auxiliary register field defines the auxiliary register used for addressing. ARF 
depends on the compatibility mode bit (CMPT) in status register ST1: 


CMPT=0 Standard mode. In standard mode, ARF always specifies the auxiliary regis- 
ter, regardless of the value in ARP. ARP is not updated. ARP must always be 
set to 0 when the DSP is in this mode. 


CMPT=1 Compatibility mode. In compatibility mode, ARP selects the auxiliary register 
if ARF = 0. Otherwise, ARF selects the auxiliary register and the ARF value 
is loaded into ARP when the access is completed. “ARO in the assembly 
instruction indicates the auxiliary register selected by ARP in compatibility 
mode. 


TS 
Note: 


ARP must always be set to 0 when the DSP is in standard mode (CMPT = 0). 


At reset, both ARP and CMPT are set to 0 automatically. 
a) 
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Memory-Mapped Register Addressing 


5.6 Memory-Mapped Register Addressing 


Memory-mapped register addressing is used to modify the memory-mapped 
registers without affecting either the current data-page pointer (DP) value or 
the current stack-pointer (SP) value. Because DP and SP do not need to be 
modified in this mode, the overhead for writing to a register is minimal. 
Memory-mapped register addressing works for both direct and indirect 
addressing. 


Figure 5-15 shows how memory-mapped addresses are generated. 
Addresses are generated by: 


(1 Forcing the nine most significant bits (MSBs) of data-memory address to 
0, regardless of the current value of DP or SP when direct addressing is 
used 


[1 Using the seven LSBs of the current auxiliary register value when indirect 
addressing is used 


Note: 


In indirect addressing, the nine MSBs of the auxiliary register are forced to 
O after the operation. 


For example, if AR1 is used to point to a memory-mapped register in memory- 
mapped register addressing mode and it contains a value of FF25h, then AR1 
points to the timer period register (PRD), since the seven LSBs of AR1 are 25h 
and the address of the PRD is 0025h. After execution, the value remaining in 
AR1 is 0025h. 


Figure 5-15. Memory-Mapped Register Addressing Block Diagram 


All bits Os 


7 LSBs from instruction register (IR) 
or current auxiliary register 


16-bit memory-mapped register address 


Note: 


In addition to registers, any scratch-pad RAM located on data page 0 can be 
modified by using memory-mapped register addressing. 
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Memory-Mapped Register Addressing 


Only eight instructions can use memory-mapped register addressing: 


LDM MMR, dst 
MVDM dmad, MMR 
MVMD MMR, dmad 
MVMM MMRx, MMRy 
POPM MMR 

PSHM MMR 

STLM src, MMR 
STM #/k, MMR 


OOUUOUOUULU 


Note: 


The following indirect addressing modes are not allowed for memory- 
mapped register addressing: 


 *ARx(Ik) 
 *+ARx(Ik) 
 *+ARx(Ik)% 
Ly *(Ik) 


In these cases, the assembler issues a warning. 


Stack Addressing 


5.7 Stack Addressing 


The system stack is used to automatically store the program counter during 
interrupts and subroutines. It can also be used at your discretion to store 
additional items of context or to pass data values. The stack is filled from the 
highest to the lowest memory address. The processor uses a 16-bit memory- 
mapped register, the stack pointer (SP), to address the stack. SP always 
points to the last element stored onto the stack. 


Four instructions access the stack using the stack addressing mode: 


(1 PSHD pushes a data-memory value onto the stack. 

[1 PSHM pushes a memory-mapped register onto the stack. 
Lj) POPD pops a data-memory value from the stack. 

1 POPM pops a memory-mapped register from the stack. 


A push predecrements and a pop postincrements the address in the SP. 
Figure 5-16 shows an example of the stack and SP before and after a push 
of X2 into the stack (PSHD X2). 


Figure 5—16. Stack and Stack Pointer Before and After a Push Operation 


Stack and SP before operation Stack and SP after operation 


SP 0011 


0001 [—] se [00] 00 
0010 oo%0 [x 


Other operations also affect the stack and the stack pointer. The stack is used 
during interrupts and subroutines to save and restore the PC contents. When 
a subroutine is called or an interrupt occurs, the return address is automatically 
saved in the stack using a push operation. Instructions used for subroutine 
calls and interrupts are CALA[D], CALL[D], CC[D], INTR, and TRAP. 


When asubroutine returns, the return address is retrieved from the stack using 
a pop operation and loaded into the PC. Instructions used for returns from sub- 
routines are RET[D], RETE[D], RETEF[D], and RC[D]. 


The FRAME instruction also affects the stack. This instruction adds a short- 
immediate offset to the stack pointer. The stack is also used in SP-referenced 
direct addressing (see section 5.4.2, SP-Referenced Direct Addressing, on 
page 5-9). 
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5.8 Data Types 


There are two basic data types for accessing memory in the 54x: 16-bit and 
32-bit. Most instructions can access 16-bit data. Accessing 32-bit data, how- 
ever, requires the use of the special instructions listed in Table 5-11. 


Table 5-11. Instructions With 32-Bit Word Operands 


Instruction Description 


DADD Double-precision add/dual 16-bit add to accumulator 

DADST Double-precision load with T add/dual 16-bit load with T add/subtract 
DLD Long-word load to accumulator 

DRSUB Double-precision subtract/dual 16-bit subtract from long word 
DSADT Long load with T subtract/dual 16-bit load with T subtract/add 

DST Store accumulator in long word 

DSUB Double-precision subtract/dual 16-bit subtract from accumulator 
DSUBT Long load with T subtract/dual 16-bit load with T subtract 


For a 16-bit operand access, a 16-bit word is read from data memory through 
the D bus and written to data memory through the E bus. For a 32-bit operand 
access, both the C (for most-significant word) and the D (for least-significant 
word) buses are used for a read. However, because only the E bus is used for 
a write, the write operation (DST instruction) is executed in two cycles. 


With 32-bit accesses, the first word accessed is treated as the most-significant 
word (MSW), while the second word accessed is the least-significant word 
(LSW). If the first word accessed is at an even address, then the second word 
is at the next (higher) address. If the first word accessed is at an odd address, 
then the second word is at the previous (lower) address. Figure 5-17 shows 
this effect. 


Data Types 


Figure 5—17. Word Order in Memory 


Accumulator MSW LSW 
DST 1000h DST 1001h 
Write order Write order 
when addr = 1000h when addr = 1001h 
1000h MSW 1000h LSW 
1001h LSW 1001h MSW 
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Chapter 6 


Program Memory Addressing 


This chapter discusses how program-memory addresses are generated and 
which addresses are loaded into the program counter (PC). This chapter also 
describes the following program control operations that affect the value loaded 
in the PC: 


Branches 

Calls 

Returns 

Conditional operations 

Repeats of an instruction or a block of instructions 
Hardware reset 

Interrupts 


CUUOUCUL 


These operations can cause a nonsequential address to be loaded into PC. 
Section 7.1, Pipeline Operation, and section 7.2, Interrupts and the Pipeline, 
are helpful in understanding the operation of PC discontinuities. Power-down 
modes halt program execution. 
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Program-Memory Address Generation 


6.1 Program-Memory Address Generation 


Program memory contains code for applications, coefficient tables, and im- 
mediate operands. The ’54x can address a total of 64K words of program 
memory using the program address bus (PAB). Table 6—1 shows devices that 
have additional program memory address lines that provide external access 
to as many as 128 64K-word pages. 


Table 6—1. Devices With Additional Program Memory Address Lines 
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Additional Provides 
Device Address Lines External Access To: 
548, 549, v 128 64K-Word Pages 
5410 
5402 4 16 64K-Word Pages 
5420 2 4 64K-Word Pages 


The program-address generation logic (PAGEN) generates the address used 
to access instructions, coefficient tables, 16-bit immediate operands, or other 
information stored in program memory, and puts this address on the PAB. 


PAGEN consists of five registers (see Figure 6—1): 


Program counter (PC) 

Repeat counter (RC) 

Block-repeat counter (BRC) 

Block-repeat start address register (RSA) 
Block-repeat end address register (REA) 


UOUUOU 


One additional register is used in the 548, ’549, 5402, ’5410, and ’5420 to ad- 
dress extended memory: 


(1 Program counter extension register (XPC) 


Program-Memory Address Generation 


Figure 6—1. Program-Address Generation Logic (PAGEN) Registers 


PAGEN 


54x devices fetch instructions by putting the value of the PC on the PAB and 
reading the appropriate location in memory. While the memory location is read, 
PC is incremented for the next fetch. If aprogram address discontinuity occurs 
(for example, a branch, a call, a return, an interrupt, or a block repeat), the 
appropriate address is loaded into the PC. The instruction addressed through 
the PAB is then loaded into the instruction register (IR). 


To improve the performance of certain instructions, the program address gen- 
eration unit is also used to fetch operands from program memory. Operands 
are fetched from program memory when the device reads from or writes to a 
coefficient table or when it transfers data between program and data space. 
Some instructions, such as FIRS, MACD, and MACP, use the program bus to 
fetch a second multiplicand. 
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Program Counter (PC) 


6.2 Program Counter (PC) 


The PC is a 16-bit register that contains the internal or external program- 
memory address used when an instruction is fetched or when a 16-bit-immedi- 
ate operand or coefficient table in program memory is accessed. To address 
program memory, the address in the PC is put onto the PAB. 


The PC can be loaded several ways. Table 6-2 shows what is loaded into the 
PC according to the code operation performed. 


Table 6-2. Loading Addresses Into PC 


Code Operation 


Reset 
Sequential execution 


Branch 


Branch from accumulator 


Block repeat loop 


Subroutine call 


Subroutine call from 
accumulator 


Hardware interrupt, soft- 
ware interrupt, or trap 


Address Loaded to the PC 
PC is loaded with FF80h. 


PC is loaded with PC + 1. 


PC is loaded with the 16-bit-immediate value directly 
following the branch instruction. 


PC is loaded with the lower 16-bit word of accumulator 
AorB. 


PC is loaded with the repeat start address (RSA) when 
PC + 1 equals the repeat end address (REA) + 1, 
provided that BRAF = 1. 


PC + 2 is pushed onto the stack, and PC is loaded with 
the 16-bit-immediate value directly following the call 
instruction mnemonic. The return instruction pops the 
top of the stack back into PC to return to the calling 
sequence of code. 


PC + 1 is pushed onto the stack, and PC is loaded with 
the lower 16-bit word of accumulator A or B. The return 
instruction pops the top of the stack back into PC to 
return to the calling sequence of code. 


PC is pushed onto the stack, and PC is loaded with the 
address of the appropriate trap vector. The return 
instruction pops the top of the stack back into PC to 
return to the interrupting sequence of code. 


Program Counter (PC) 


The XPC is a 7-bit register that selects the extended page of program memory 
for the 548, ’549, ’5402, 5410, and ’5420. For more information about ex- 
tended program memory in these devices, see section 3.2.5, Extended Pro- 
gram Memory, on page 3-20. 


The XPC can be loaded in several ways in conjunction with the loading of the 
PC. Table 6-3 lists operations that load XPC. 


Table 6-3. Loading Addresses into XPC 


Code Operation 


Reset 


Sequential execution 


Far branch 


Far branch from accu- 
mulator 


Far subroutine call 


Far subroutine call from 
accumulator 


Far return 


Address Loaded to the PC 
PC is loaded with FF80h. XPC is loaded with Oh. 


PC is loaded with PC + 1. XPC is not automatically 
incremented. 


PC is loaded with bits 15—0 of the immediate value direct- 
ly following the branch instruction. XPC is loaded bits 
23-16 of that value. 


PC is loaded with bits 15-0 of accumulator A or B. XPC 
is loaded with bits 23-16 of accumulator A or B. 


PC +2 is pushed onto the stack, XPC is pushed onto the 
stack, PC and XPC are loaded with bits 15-0 and bits 
23-16, respectively, of the immediate value specified by 
the call instruction. 


PC +1 is pushed onto the stack, XPC is pushed onto the 
stack, and the PC and XPC are loaded with bits 15-0 
and bits 23-16, respectively, of accumulator A or B. 


The return instruction pops the top of the stack into XPC 
and pops the next value into the PC to return to the call- 
ing sequence of code. 


We 


Note: 


The XPC is not loaded by instructions other than those listed in Table 6-3. 


a 


Program Memory Addressing 6-5 


Branches 


6.3 Branches 


Branches break the sequential flow of instructions by transferring control to 
another location in program memory. Therefore, branches affect the program 
address generated and stored in PC. The 54x performs both unconditional 
and conditional branches, and both of these types can be either nondelayed 
or delayed. 


6.3.1. Unconditional Branches 
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An unconditional branch is always executed when it is encountered. During 
the execution, PC is loaded with the specified branch-to-program-memory 
address and execution of the new section of code begins at that address. The 
address loaded into PC comes from either the second word of the branch 
instruction or the lower 16 bits of an accumulator (accumulator A or accumula- 
tor B). 


By the time the branch instruction reaches the execute phase of the pipeline, 
the next two instruction words have already been fetched. How these two 
instruction words are handled depends in part on whether the branch is non- 
delayed or delayed: 


(1 Nondelayed: The two instruction words are flushed from the pipeline so 
that they are not executed, and then execution continues at the branched- 
to address. 


Lj Delayed: The one 2-word instruction or two 1-word instructions following 
the branch instruction are executed. This allows you to avoid flushing the 
pipeline, which requires extra cycles. 


—  ——— a “aaa 000 0 qa aaaaaaa—ac——uaoQ“qq—————M—MMMG<G“<“GGG“——<—_—n 
Note: 


The two words following a delayed instruction cannot be an instruction that 
causes a PC discontinuity (a branch, call, return, or software interrupt). 


Table 6—4 shows the unconditional branch instructions in the ‘54x and the 
number of cycles needed to execute these instructions (both nondelayed and 
delayed). Delayed instructions use two cycles fewer than the corresponding 
nondelayed instructions because they do not flush the pipeline. 


Branches 


Table 6-4. Unconditional Branch Instructions 


Number of Cycles 


Instruction Description (Nondelayed / Delayed) 
B[D] Load PC with the address specified by 4/2 


the instruction 


BACC[D] Load PC with the address specified by 6/4 
the low 16 bits of the designated 
accumulator 


6.3.2 Conditional Branches 


Conditional branches operate like unconditional branches, but they execute 
only when one or more user-specified conditions are met. The possible condi- 
tions are given in Table 6-13 on page 6-16. If all the conditions are met, PC 
is loaded with the second word of the branch instruction, which contains the 
address to branch to, and execution continues at this address. 


By the time the conditions have been tested, the two instruction words follow- 
ing the conditional branch instruction have already been fetched and are in the 
pipeline. How these two instruction words are handled depends in part on 
whether the branch is nondelayed or delayed: 


Lj Nondelayed: If all the conditions are met, these two instruction words are 
flushed from the pipeline so that they are not executed, and then execution 
continues at the branched-to address. If the conditions are not met, the 
two instruction words are executed instead of the branch. 


Lj Delayed: The one 2-word instruction or two 1-word instructions following 
the branch instruction are executed. This allows you to avoid flushing the 
pipeline, which requires extra cycles. The conditions tested are not 
affected by the instructions following the delayed branch. 


Note: 


The two words following a delayed instruction cannot be an instruction that 
causes a PC discontinuity (a branch, call, return, or software interrupt). 


Table 6—5 shows the conditional branch instructions and the number of cycles 
needed to execute these instructions. Because conditional branches use 
conditions determined by the execution of the previous instructions, the condi- 
tional branch instruction, BC[D], requires one more cycle than an uncondi- 
tional one. 
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Branches 


Table 6-5. Conditional Branch Instructions 


Number of Cycles 
(Condition met / Not met) 


Instruction Description Nondelayed Delayed 


BC[D] Load PC with the address speci- 5/3 3/3 
fied by the instruction if the condi- 
tion specified by the instruction is 
met 


BANZ[D] Load PC with the address speci- 4/2 2/2 
fied by the instruction if currently 
selected auxiliary register not 
equal to 0 (useful for loops) 


6.3.3 Far Branches (Available on ’548/'549/'5402/'5410/’5420) 


To allow branches to extended memory, the 548, '549,'5402, 5410, and ’5420 
include two far branch instructions: 


_j FB[D] branches to the 23-bit address (20-bit addresses in the 5402 and 
18-bit addresses in the 5420) specified by the the instruction. 


(4 FBACC[D] branches to the 23-bit address (20-bit addresses in the ‘5402 
and 18-bit addresses in the 5420) specified in the designated accumula- 
tor. 


Table 6-6 shows the far branch instructions in the 548, 549, 5402, 5410, and 
5420 (both nondelayed and delayed) and the number of cycles needed to 
execute these instructions. Delayed instructions use two cycles fewer than the 
corresponding nondelayed instructions. 


Table 6-6. Far Branch Instructions 
Number of Cycles 
Instruction Description (Nondelayed / Delayed) 


FB[D] Load the PC and the XPC with the 4/2 
address specified in the instruction 


FBACC[D] Load the PC and the XPC with the 6/4 
address specified by the lower 23 bits 
of the designated accumulator 


6-8 


6.4 Calls 


Calls 


Like branches, calls break the sequential flow of instructions by transferring 
control to some other location in program memory. However, unlike branches, 
this transfer is intended to be temporary. When a subroutine or function is 
called, the address of the next instruction following the call is saved in the 
stack. This address is used to return to the calling program and resume execu- 
tion. The ’54x performs both unconditional and conditional calls, and both of 
these types can be either nondelayed or delayed. 


6.4.1. Unconditional Calls 


An unconditional call is always executed when it is encountered. When the call 
is executed, the PC is loaded with the specified program-memory address and 
execution of the called routine begins at that address. The address loaded into 
PC can come from either the second word of the call instruction or the lower 
16 bits of an accumulator (accumulator A or accumulator B). Before the PC is 
loaded, the return address is saved in the stack. After the subroutine or func- 
tion is executed, a return instruction loads the PC with the return address from 
the stack, and execution resumes at the instruction following the call. 


By the time the unconditional call instruction reaches the execute phase of the 
pipeline, the next two instruction words have already been fetched. How these 
two instruction words are handled depends in part on whether the call is non- 
delayed or delayed: 


(1 Nondelayed: The two instruction words are flushed from the pipeline so 
that they are not executed, the return address is stored to the stack, and 
then execution continues at the beginning of the called function. 


Lj Delayed: The one 2-word instruction or two 1-word instructions following 
the call instruction are executed. This allows you to avoid flushing the pipe- 
line, which requires extra cycles. 


Note: 


The two words following a delayed instruction cannot be an instruction that 


causes a PC discontinuity (a branch, call, return, or software interrupt). 
a) 


Table 6—7 shows the unconditional call instructions in the 54x (both nonde- 
layed and delayed) and the number of cycles needed to execute these instruc- 
tions. Delayed instructions need two cycles fewer than the corresponding non- 
delayed instructions because they do not flush the pipeline. 
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Table 6-7. Unconditional Call Instructions 


Number of Cycles 
Instruction Description (Nondelayed / Delayed) 


CALLID] Places the return address on the 4/2 
stack and then loads the PC with the 
address specified by the instruction 


CALAID] Places the return address on the 6/4 
stack and then loads the PC with the 
address specified in the designated 
accumulator 


6.4.2 Conditional Calls 


Conditional calls operate like unconditional calls, but they execute only when 
one or multiple conditions are met. The possible conditions are given in 
Table 6-13 on page 6-16. If all the conditions are met, the PC is loaded with 
the second word of the call instruction, which contains the starting address of 
the function to be called. Before branching to the called function, the processor 
stores the address of the instruction following the call instruction to the stack. 
The function must end with a return instruction, which takes the address off the 
stack and loads PC, allowing the processor to resume execution of the calling 
program. 


By the time the conditions of the conditional call instruction have been tested, 
the two instruction words following the call instruction have already been 
fetched in the pipeline. How these two instruction words are handled depends 
in part on whether the call is nondelayed or delayed: 


Lj Nondelayed: If all the conditions are met, these two instruction words are 
flushed from the pipeline so that they are not executed, and then execution 
continues at the beginning of the called function. If the conditions are not 
met, the two instructions are executed instead of the call. 


Lj Delayed: The one 2-word instruction or two 1-word instructions following 
the call instruction are always executed. This allows you to avoid flushing 
the pipeline, which requires extra cycles. The conditions tested are not 
affected by the instructions following the delayed call. If the conditions are 
not met, the processor executes the two instruction words instead of the 
call. 


Note: 


The two words following a delayed instruction cannot be an instruction that 
causes a PC discontinuity (a branch, call, return, or software interrupt). 


Calls 


Table 6—8 shows the conditional call instruction and the number of cycles 
needed to execute this instruction. Because there is a wait cycle for conditions 
to become stable, the conditional call instruction, CC[D], requires one more 
cycle than the unconditional one. 


Table 6-8. Conditional Call Instruction 


Number of Cycles 
(Condition met / Not met) 


Instruction Description Nondelayed Delayed 


CC[D] Places the return address on the 5/3 3/3 
stack and then loads the PC with 
the address specified by the in- 
struction if the condition specified 
by the instruction is met 


6.4.3 Far Calls (Available on 548 /’549/’5402/'5410/'5420) 


To allow calls to extended memory, the 548, 549, 5402, 5410, and 5420 in- 
clude two far call instructions: 


(1 The FCALL instruction pushes XPC onto the stack, pushes PC onto the 
stack, and branches to the 23-bit address (20-bit addresses in the 5402 
and 18-bit addresses in the 5420) specified by the the instruction. 


[1 The FCALA pushes XPC onto the stack, pushes PC onto the stack, and 
branches to the 23-bit address (20-bit addresses in the 5402 and 18-bit 
addresses in the 5420) specified in the designated accumulator. 


Table 6—9 shows the far call instructions in the 548, 549, 5402, 5410, and 
5420 (nondelayed and delayed) and the number of cycles needed to execute 
these instructions. Note that delayed instructions need two cycles fewer than 
the corresponding nondelayed instructions. 


Table 6-9. Far Call Instructions 


Number of Cycles 
Instruction Description (Nondelayed / Delayed) 


FCALLID] Places XPC and PC on the stack and 4/2 
then loads XPC and PC with the ad- 
dress specified by the instruction 


FCALA[D] Places XPC and PC on the stack and 6/4 
then loads XPC and PC with the ad- 
dress specified in the designated ac- 
cumulator 
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6.5 Returns 


Return instructions provide a way to resume processing of a sequence of 
instructions that was broken by a call to another function or an interrupt service 
routine. When the called function or interrupt service routine has completed 
its execution, it is necessary to resume processing at the point immediately 
following the call or the point at which the interrupt occurred. Return instruc- 
tions accomplish this by popping the top value of the stack, which contains the 
address of the next instruction to be executed, into the program counter (PC). 
The ’54x performs both unconditional and conditional returns, and both of 
these types can be either nondelayed or delayed. 


The '548, 549, 5402, 5410, and ’5420 offer an additional return instruction: 
an unconditional far return, both nondelayed and delayed. 


6.5.1. Unconditional Returns 
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An unconditional return is always executed when it is encountered. When the 
return is executed, PC is loaded with the return address from the stack and 
execution resumes at the instruction following the instruction that called the 
function or at the point at which the interrupt occurred. 


By the time the unconditional return instruction reaches the execute phase of 
the pipeline, the next two instruction words have already been fetched. How 
these two instruction words are handled depends in part on whether the return 
is nondelayed or delayed. 


(1 Nondelayed: The two instruction words are flushed from the pipeline so 
that they are not executed, the return address is taken from the stack or 
from the RTN register, and then execution continues at that address in the 
calling function. 


Lj Delayed: The one 2-word instruction or two 1-word instructions following 
the return instruction are executed. This lets you avoid flushing the pipe- 
line, which requires extra cycles. The return address is taken from the 
stack or from the RTN register. 


aaa, | 
Note: 


The two words following a delayed instruction cannot be an instruction that 
causes a PC discontinuity (a branch, call, return, or software interrupt). 


Table 6-10 shows the unconditional return instructions in the 54x (nonde- 
layed and delayed) and the number of cycles needed to execute these instruc- 
tions. Delayed instructions need two cycles fewer than the corresponding non- 
delayed instructions. 


Returns 


Table 6-10. Unconditional Return Instructions 


Number of Cycles 
Instruction Description (Nondelayed / Delayed) 


RET[D] Load the PC with the return ad- 5/3 
dress at the top of the stack 


RETE[D] Load the PC with the return ad- 5/3 
dress at the top of the stack, and 
enable maskable interrupts 


RETF[D] Load the PC with the return ad- 3/1 
dress in the RTN register, and en- 
able maskable interrupts 


Enabling interrupts with the RETE and RETF instructions ensures that the 
return executes before another interrupt is processed. By using the RETF 
instruction, loading the PC from the RTN register rather than the stack allows 
a quicker return. This reduces the total number of cycles used by an interrupt 
routine, which is particularly important for short, frequently used interrupt 
routines. 


aaa, | 
Note: 


The RTN register is a CPU-internal register that you cannot read from or write 


to. 
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6.5.2 Conditional Returns 


By using the conditional return (RC) instruction, you can give a function or 
interrupt service routine (ISR) more than one possible return path. The path 
chosen depends on the data being processed. In addition, you can use a 
conditional return to avoid conditionally branching to/around the return instruc- 
tion at the end of the function or ISR. 


Conditional returns operate like unconditional returns, but they execute only 
when one or more conditions are met. The possible conditions are given in 
Table 6-13 on page 6-16. If all the conditions are met, the processor loads the 
return address from the stack to PC, and resumes execution of the calling 
program. 


The conditional return is a single-word instruction; however, because of the 
potential PC discontinuity, it operates with the same effective execution time 
as the conditional branch or call. 
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Table 6-117. 
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By the time the conditions of the conditional return instruction have been 
tested, the two instruction words following the return instruction have already 
been fetched in the pipeline. How these two instruction words are handled 
depends in part on whether the return is nondelayed or delayed: 


Lj Nondelayed: If all the conditions are met, these two instruction words are 
flushed from the pipeline so that they are not executed, and then execution 
of the calling program continues. If the conditions are not met, the two 
instructions are executed instead of the return. 


(1 Delayed: The processor executes the two instructions that follow the 
return instruction. This allows you to avoid flushing the pipeline, which 
requires extra cycles. The conditions tested are not affected by the instruc- 
tions following the delayed return. 


Note: 


The two words following a delayed instruction cannot be an instruction that 


causes a PC discontinuity (a branch, call, return, or software interrupt). 
sss 


Table 6-11 shows the conditional return instruction and the number of cycles 
needed to execute this instruction. 


Conditional Return Instruction 


Number of Cycles 
(Condition met / Not met) 


Instruction Description Nondelayed Delayed 
RC[D] Load PC with the return address 5/3 3/3 
at the top of the stack if the condi- 


tion specified by the instruction is 
met 


Far Returns (Available on °548/°549/5402/5410/'5420) 


To allow returns from extended memory, the 548, 549, 5402, 5410, and 
5420 include two far-return instructions: 


1 FRET loads XPC from the stack and then loads PC from the stack, allow- 
ing program execution to resume at the previous point. 


[1 FRETE loads XPC from the stack, loads PC from the stack, and enables 
maskable interrupts. 


Returns 


Table 6-12 shows the far return instructions in the 548, ’549, 5402, 5410, and 
5420 (nondelayed and delayed) and the number of cycles needed to execute 
these instructions. Note that delayed instructions need two cycles fewer than 
the corresponding nondelayed instructions. 


Table 6-12. Far Return Instructions 


Number of Cycles 


Instruction Description (Nondelayed / Delayed) 
FRET[D] Loads XPC with the value at the top 6/4 


of the stack and loads PC with the 
next value on the stack 


FRETE[D] Loads XPC with the value at the top 6/4 
of the stack, loads PC with the next 
value on the stack, and enables 
maskable interrupts 
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6.6 Conditional Operations 


The ’54x includes instructions that execute only if one or more conditions are 
met. Table 6—13 lists the conditions that you can use with these instructions 
and their corresponding operand symbols. 


Table 6-13. Conditions for Conditional Instructions 


‘Condition —sDescription =—s—=<C—s~Ssé‘S;SSS*‘« pera: 
A=0 Accumulator A equal to 0 AEQ 
B=0 Accumulator B equal to 0 BEQ 
Az#0 Accumulator A not equal to 0 ANEQ 
Bz#0 Accumulator B not equal to 0 BNEQ 
A<0 Accumulator A less than 0 ALT 
B<0 Accumulator B less than 0 BLT 
A<s0 Accumulator A less than or equal to 0 ALEQ 
B<0 Accumulator B less than or equal to 0 BLEQ 
A>0O Accumulator A greater than 0 AGT 
B>0 Accumulator B greater than 0 BGT 
Az=0 Accumulator A greater than or equal to 0 AGEQ 
B20 Accumulator B greater than or equal to 0 BGEQ 
AOV = 1 Accumulator A overflow detected AOV 
BOV = 1 Accumulator B overflow detected BOV 
AOV =0 No accumulator A overflow detected ANOV 
BOV =0 No accumulator B overflow detected BNOV 
C=1 ALU carry set to 1 C 
C=0 ALU carry cleared to 0 NC 
TC =1 Test/control flag set to 1 TC 
TC =0 Test/control flag cleared to 0 NTC 
BIO low BIO signal is low BIO 
BIO high BIO signal is high NBIO 
none Unconditional operation UNC 
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Using Multiple Conditions 


Multiple conditions can be listed as operands of the conditional instructions. 
If multiple conditions are listed, all conditions must be met for the instruction 
to execute. Only certain combinations of conditions are acceptable (see 
Table 6-14). For each combination, the conditions must be selected from 
Group 1 or Group 2 as follows: 


(1 Group 1: You can select one condition from category A and one condi- 
tion from category B. The two conditions cannot be from the same catego- 
ry. For example, you can test EQ and OV at the same time but you cannot 
test GT and NEQ at the same time. The accumulator must be the same 
for both conditions; you cannot test conditions for both accumulators with 
the same instruction. For example, you can test AGT and AOV atthe same 
time, but you can not test AGT and BOV at the same time. 


[1 Group 2: Youcan select one condition from each of three categories (A, 
B, and C). No two conditions can be from the same category. For example, 
you can test TC, C, and BIO at the same time, but you cannot test NTC, 
C, and NC at the same time. 


Table 6-14. Grouping of Conditions for Multiconditional Instructions 


Group 1 Group 2 
‘CategoryA CategoryB —CategoryA CategoryB Category _ 
EQ OV TC Cc BIO 
NEQ NOV NTC NC NBIO 
LT 
LEQ 
GT 
GEQ 


6.6.2 Conditional Execute (XC) Instruction 


Where code branches conditionally over a 1- or 2-word code segment, you can 
replace the branch with a 1-cycle conditional execute instruction (XC). There 
are two forms for the XC instruction. One form is a conditional execute of a 
1-word instruction (XC 1, cond). The second form is a conditional execute of 
one 2-word instruction or two 1-word instructions (XC 2, cond). Conditions for 
XC are the same as the conditions for conditional branches, calls, and returns 
(see Table 6-13). 
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The condition must be stable two full cycles before the XC instruction is 
executed. This ensures that the decision is made before the instruction follow- 
ing XC is decoded. Avoid changing the XC condition in the two 1-word instruc- 
tions prior to XC. If no interrupts occur, these instructions have no effect on XC. 
However, if an interrupt occurs, it can trap between the instructions and XC, 
affecting the condition before XC is executed. See Chapter 7, Pipeline, for 
information about pipeline latencies. 


6.6.3 Conditional Store Instructions 


Some CPU registers can be conditionally stored in data memory using the 
conditional store instructions, listed in Table 6-15. The conditions used with 
conditional store instructions are listed in Table 6-16. 


In a conditional store instruction, the address is modified and the memory 
operand is read regardless of the condition. If the condition is met, the corre- 
sponding register is stored in data memory. If the condition is not met, the oper- 
and is written into the same memory location from which it was read, so, the 
value of that memory location remains the same. 


The conditional store instructions are single memory-operand instructions, but 
they use the dual memory-operand indirect addressing mode to put the 
instruction into one 16-bit word. Therefore, these instructions execute in one 
cycle. 


Conditionally storing the block-repeat counter (BRC) allows you to store an 
index in a repeat-block loop. 


Table 6-15. Conditional Store Instructions 


Instruction CPU Register 

SACCD Accumulator A or B 
STRCD Temporary register (T) 
SRCCD Block-repeat counter (BRC) 
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Table 6-16. Conditions for Conditional Store Instructions 


Operand 
AEQ 


BEQ 
ANEQ 
BNEQ 
ALT 
BLT 
ALEQ 
BLEQ 
AGT 
BGT 
AGEQ 
BGEQ 


Condition 


A=0 
B=0 
Az#0 
Bz#0 
A<0O 
B<0 
As0 
B<0 
A>0O 
B>0 
A220 
B20 


Description 

Accumulator A equal to 0 
Accumulator B equal to 0 
Accumulator A not equal to 0 
Accumulator B not equal to 0 
Accumulator A less than 0 
Accumulator B less than 0 
Accumulator A less than or equal to 0 
Accumulator B less than or equal to 0 
Accumulator A greater than 0 
Accumulator B greater than 0 
Accumulator A greater than or equal to 0 


Accumulator B greater than or equal to 0 
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6.7 Repeating a Single Instruction 


The ’54x includes two instructions, RPT and RPTZ, that cause the nextinstruc- 
tion to be repeated. The number of times for the instruction to be repeated is 
obtained from an operand of the instruction, and is equal to this operand + 1. 
This value is stored in the 16-bit repeat counter (RC) register. You cannot 
program the value in the RC register; it is loaded by the repeat instructions 
(RPT or RPTZ) only. The maximum number of executions of a given instruction 
is 65 536. An absolute program or data address is automatically incremented 
when the single-repeat feature is used. 


Once a repeat instruction is decoded, all interrupts, including NMI but not RS, 
are disabled until the completion of the repeat loop. However, the 54x does 
respond to the HOLD signal while executing an RPT/RPTZ loop—the 
response depends on the value of the HM bit of ST1. 


The repeat function can be used with some instructions, such as multiply/ 
accumulate and block moves, to increase the execution speed of these 
instructions. These multicycle instructions (see Table 6-17) effectively 
become single-cycle instructions after the first iteration of a repeat instruction. 


Table 6-17. Multicycle Instructions That Become Single-Cycle Instructions When 


Repeated 


Instruction Description # Cyclest 
FIRS Symmetrical FIR filter 3 
MACD Multiply and move result in accumulator with delay 3 
MACP Multiply and move result in accumulator 8} 
MVDK Data-to-data move 2 
MVDM Data-to-MMR move 2 
MVDP Data-to-program move 4 
MVKD Data-to-data move 2 
MVMD MMR-to-data move 2 
MVPD Program-to-data move 3 
READA Program-to-data move 5 
WRITA Data-to-program move 5 


t Number of cycles when instruction is not repeated 


Single data-memory operand instructions cannot be repeated if a long offset 
modifier or an absolute address is used (for example, *ARn(Ik), *+ARn(Ik), 
*+ARn(Ik)% and *(Ik)). Instructions listed in Table 6-18 cannot be repeated 
using RPT. 


Table 6-18. Nonrepeatable Instructions 


Instruction 


ADDM 
ANDM 
B[D] 
BACC[D] 
BANZ[D] 
BC[D] 
CALA[D] 
CALL[D] 
CC[D] 
CMPR 
DST 
FB[D] 
FBACC[D] 
FCALA[D] 
FCALL[D] 
FRET[D] 
FRETE[D] 
IDLE 
INTR 

LD ARP 
LD DP 
MVMM 
ORM 
RC[D] 
RESET 
RET[D] 
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Description 

Add long constant to data memory 

AND data memory with long constant 
Unconditional branch 

Branch to accumulator address 

Branch on auxiliary register not 0 

Conditional branch 

Call to accumulator address 

Unconditional call 

Conditional call 

Compare with auxiliary register 

Long word (32-bit) store 

Far branch unconditionally 

Far branch to location specified by accumulator 
Far call subroutine at location specified by accumulator 
Far call unconditionally 

Far return 

Enable interrupts and far return from interrupt 
Idle instructions 

Interrupt trap 

Load auxiliary register pointer (ARP) 

Load data page pointer (DP) 

Move memory-mapped register (MMR) to another MMR 
OR data memory with long constant 
Conditional return 

Software reset 


Unconditional return 
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Table 6-18. Nonrepeatable Instructions (Continued) 
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Instruction 
RETE[D] 
RETF[D] 
RND 
RPT 
RPTBI[D] 
RPTZ 
RSBX 
SSBX 
TRAP 
XC 
XORM 


Description 


Return from interrupt 

Fast return from interrupt 

Round accumulator 

Repeat next instruction 

Block repeat 

Repeat next instruction and clear accumulator 
Reset status register bit 

Set status register bit 

Software trap 

Conditional execute 


XOR data memory with long constant 
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6.8 Repeating a Block of Instructions 


The repeat-block instructions are used to repeat a block of code N + 1 times, 
where N is the value loaded into the block-repeat counter register (BRC). This 
block of code can contain one or more instructions. Unlike the repeat single 
operation, which disables all maskable interrupts, the repeat block operation 
can be interrupted. 


The instructions used for this operation are RPTB and RPTBD (a delayed 
instruction). The RPTB instruction executes in four cycles. RPTBD allows the 
execution of one 2-word instruction or two 1-word instructions following the 
RPTBD instruction instead of flushing the pipeline; thus, RPTBD effectively 
executes in 2 cycles. When the RPTBD instruction is used, delayed instruc- 
tions cannot be in the two words following the RPTBD instruction. 


The repeat block feature provides zero-overhead looping. Zero-overhead 
looping is controlled by the block-repeat active flag (BRAF) in ST1 and the 
following memory-mapped registers: 


[1 BRC contains the value N, which is one less than the number of times the 
block is to be repeated. 


[1 Theblock-repeat start address register (RSA) holds the address of the first 
instruction of the block of code to be repeated. 


_j] The block-repeat end address register (REA) holds the address of the last 
instruction word of the block of code to be repeated. 


BRAF is set to 1 to activate the block repeat. The block repeat feature can be 
activated only if the number of iterations is greater than 0. The following steps 
start a loop: 


Step 1: You load BRC with a loop count in the 0 through 65 535 range. 


Step 2: The instruction loads the address of the first instruction to be 
repeated. This instruction is the one immediately following RPTB of 
the second instruction following RPTBD. The repeat-block (RPTB) 
or repeat block with delay (RPTBD) instruction automatically loads 
RSA with the address of the instruction following the RPTB instruc- 
tion, or with the address of the second instruction following the 
RPTBD instruction. 
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Step 3: The instruction loads REA with the address following the last word 
of the last instruction to be repeated in the block, which is also the 
long-immediate operand given in the instruction. This action also 
sets BRAF. REA is loaded with the 16-bit-immediate operand of the 
RPTB or RPTBD instruction, and the BRAF bit is set. The value for 
the 16-bit-immediate operand of RPTB or RPTBD is L— 1, where L 
is the address of the instruction following the last word of the last 
instruction in the loop. 


Every time the PC is updated during loop execution, REA is compared to the 
PC value. If the values are equal, BRC is decremented. If BRC is greater than 
or equal to 0, RSA is loaded into the PC to restart the loop. If not, BRAF is reset 
to 0 and the processor resumes execution past the end of the loop. 


BRC is decremented during the instruction decode phase of the last repeat 
block instruction. For this reason, be careful when using the SRCCD instruc- 
tion within a loop. To save the current loop counter value (the predecremented 
BRC), the SRCCD instruction must be placed a minimum of three instructions 
before the end of the loop. 


There is only one set of block repeat registers, so multiple block repeats cannot 
be nested without saving the context of the outside loops. The simplest way 
of establishing nested loops is to use the RPTB[D] instruction for the innermost 
loop only, and use the BANZ[D] for all outer loops. 


6.9 Reset Operation 


Reset Operation 


Reset (RS) is a nonmaskable external interrupt that can be used at any time to 
place the 54x into a known state. For correct system operation after power-up, 
RS must be asserted (low) for several clock cycles to ensure that the data, ad- 
dress, and control lines are configured properly. Approximately five clock 
cycles after RS is de-asserted (goes high), the processor fetches the instruc- 
tion at FF80h and begins executing code. See section 10.5, Start-Ujp Access 
Sequences, on page10-24, for the reset sequence. 


The following actions occur during a reset operation: 


DOUUOUUUUUOUUCUUOU 


IPTR is set to 1FFh. 

RS is de-asserted. 

The MP/MC bit in PMST is set to the value of the MP/MC pin. 

PC is set to FF80h. 

XPC is cleared (548, ’549, 5402, 5410, and 5420). 

FF80h is driven on the address bus, regardless of the state of MP/MC. 
The data bus goes into the high-impedance state. 

The control lines are made inactive. 

The IACK signal is generated. 

INTM is set to 1 to disable all maskable interrupts. 

IFR is cleared to clear the interrupt flags. 

The single repeat counter (RC) is cleared. 

Asynchronized reset (SRESET) signal is sent to initialize the peripherals. 
The following status bits are set to their initial values: 


m ARP=0 m CLKOFF=0 @ HM=0 mM SXM=1 
m ASM=0 m CMPT=0 gm INTM=1 m TC=1 
gm AVIS=0 m CPL=0 gm OVA=0 mM XF=1 
m BRAF=0 mm DP=0 m OVB=0 

m C=1 m DROM=0 m OVLY=0 

m Ci6=0 m FRCT=0 m OVM=0 


NT 
Notes: 


1) 


3) 


The remaining status bits are not initialized—your code must initialize 
them appropriately. 


Reset does not initialize the stack pointer (SP). Your code must initialize it. 


If MP/MC = 0, the device begins executing code from the on-chip ROM. 
Otherwise, it begins executing code from off-chip memory. 


eee eee) 
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6.10 Interrupts 


Interrupts are hardware- or software-driven signals that cause the ’54x to 
suspend its main program and execute another function called an interrupt 
service routine (ISR). Typically, interrupts are generated by hardware devices 
that need to give data to or take data from the 54x (for example, ADCs, DACs, 
and other processors). Interrupts can also be used to signal that a particular 
event has taken place (for example, the timer is finished counting). 


The ’54x supports both software and hardware interrupts: 


( A software interrupt is requested by a program instruction (INTR, TRAP, 
or RESET). 


( A hardware interruptis requested by a signal from a physical device. Two 
types exist: 


m= External hardware interrupts are triggered by signals at external inter- 
rupt ports. 


Mm Internal hardware interrupts are triggered by signals from the on-chip 
peripherals. 


When multiple hardware interrupts are triggered at the same time, the 54x 
services them according to a set priority ranking in which 1 has the highest 
priority. To determine the priorities for the hardware interrupts, refer to the table 
for your particular 54x device in section 6.10.10, Interrupt Tables, on 
page 6-38. 


Each of the ’54x interrupts, whether hardware or software, can be placed in 
one of the following two categories: 


(1 Maskable interrupts. These are hardware or software interrupts that can 
be blocked (masked) or enabled (unmasked) using software. The '54x 
supports up to 16 user-maskable interrupts (SINT15—SINTO). Each 
device uses a subset of these 16 interrupts. For example, the ‘541 uses 
only nine of these interrupts (the others are tied high internally). Some of 
these have two names because they can be initiated by software or 
hardware; for the 541, the hardware names for these interrupts are: 


m INT3 through INTO 
m RINTO, XINTO, RINT1, and XINT1 (serial port interrupts) 
mw TINT (timer interrupt) 


(1 Nonmaskable interrupts. These interrupts cannot be blocked. The ’54x 
always acknowledges this type of interrupt and branches from the main 
program to an ISR. The 54x nonmaskable interrupts include all software 
interrupts and two external hardware interrupts: RS (reset) and NMI. (RS 
and NMI can also be asserted using software.) 


Interrupts 


RS is anonmaskable interrupt that affects all 54x operating modes. See 
section 6.9, Reset Operation, on page 6-25. NMI is a nonmaskable inter- 
rupt. Interrupts are globally disabled when NMI is asserted. NMI is differ- 
ent from RS because it does not affect any of the 54x modes. 


The ’54x handles interrupts in three phases: 


1) 


Receive interrupt request. Suspension of the main program is requested 
via software (program code) or hardware (a pin or an on-chip peripheral). 
If the interrupt source is requesting a maskable interrupt, the correspond- 
ing bit in the interrupt flag register (IFR) is set when the interrupt is 
received. 


Acknowledge interrupt. The ‘54x must acknowledge the interrupt request. 
If the interrupt is maskable, predetermined conditions must be metin order 
for the 54x to acknowledge it. For nonmaskable hardware interrupts and 
for software interrupts, acknowledgment is immediate. 


Execute interrupt service routine (ISR). Once the interrupt is acknowl- 
edged, the ‘54x executes the branch instruction you place at a predeter- 
mined address (the vector location) and performs the ISR. 


6.10.1 Interrupt Flag Register (IFR) 


IFR is a memory-mapped CPU register that identifies and clears active inter- 
rupts (see Figure 6—2). An interrupt sets its corresponding interrupt flag in IFR 
until itis recognized by the CPU. Any of the following four events clear an inter- 
rupt flag: 


L} 
L} 
L] 
Lj 


The ’54x is reset (RS is low). 

An interrupt trap is taken. 

A 1 is written to the appropriate bit in IFR. 

The INTR instruction is executed using the appropriate interrupt number. 


A 1 in any IFR bit indicates a pending interrupt. To clear an interrupt, write a 
1 to the interrupt’s corresponding bit in the IFR. All pending interrupts can be 
cleared by writing the current contents of the IFR back into the IFR. 
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Figure 6-2. Interrupt Flag Register (IFR) Diagram 


(a) 541 IFR 
15-12 11 
INT3 | XINT1 | RINT1 | XINTO | RINTO | TINT | INT2 | INT1 | INTO 
(b) 542 IFR 
15-12 11 
HPINT | INT3 | TXINT | TRINT |BXINTO [BRINTO | TINT | INT2 | INT1 | INTO 
(c) 543 IFR 
15-12 11 
INT3 | TXINT | TRINT |BXINTO [BRINTO | TINT | INT2 | INT1 | INTO 
(d) 545 IFR 
15-12 11 
HPINT | INT3 | XINT1 | RINT1 |BXINTO|BRINTO | TINT | INT2 | INT1 | INTO 
(e) 546 IFR 
15-12 11 
INT3 | XINT1 | RINT1 |BXINTO [BRINTO| TINT | INT2 | INT1 | INTO 
(f) 548 IFR 
15-12 11 
} Resvd | BXINT! BRINT1 }HPINT | INT3 | TXINT | TRINT |BXINTO |BRINTO | TINT | INT2 | INT1 | INTO 
(g) 549 IFR 
15-14 13 12 11 10 s 8 7 6 5 4 . 2 4 «@ 


| Resvd [BMINT! BMINTO | BXINT1 | BRINT1 |HPINT | INT3 | TXINT | TRINT | BXINTO | BRINTO | TINT | INT2 | INT1 | INTO 
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(h) 5402 IFR 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 


BXINT1 
or 
DMAC3 


Resvd DMAC5 | DMAC4 


(i) 5410 IFR 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 


BXINT1 
Resvd DMAC5|DMAC4| _ or or HPINT | INT3 
DMAC3 


(j) ‘5420 IFR 


is. 14 2 12 5 4 3 2 1 
BXINT2 | BRINT2 

Resvd | IPINT | DMAC5|DMAC4 HPINT | RSVD | _ or or _|BXINTO|BRINTO| TINT | RSVD | INT1 
DMAC1 | DMACO 

6.10.2 Interrupt Mask Register (IMR) 


Figure 6—3 shows how the ’54x uses a memory-mapped IMR for masking ex- 
ternal and internal interrupts. If INTM = 0 in ST1, a 1 in any IMR bit enables 
the corresponding interrupt. Neither NMI nor RS is included in the IMR, be- 
cause IMR has no effect on these interrupts. You can read or write to the IMR. 


Figure 6—3. Interrupt Mask Register (IMR) Diagram 
(a) ‘541 IMR 


15-12 


INT3 |XINT1 [RINT | XINTO | RINTO | TINT | INT2 | INT1 | INTO 


(b) ‘542 IMR 


15-12 


HPINT | INT3 | TXINT | TRINT |BXINTO |BRINTO | TINT | INT2 | INT1 | INTO 


(c) 543 IMR 


15-12 


INT3 | TXINT | TRINT |BXINTO |BRINTO| TINT | INT2 | INT1 | INTO 
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(d) 545 IMR 

15-12 

HPINT | INT3 | XINT1 | RINT1 |BXINTO [BRINTO | TINT | INT2 | INT1 | INTO 
(e) 546 IMR 

15-12 

INT3 | XINT1 | RINT1 |BXINTO [BRINTO| TINT | INT2 | INT1 | INTO 
(f) 548 IMR 

1512 11 

| Resvd | BXINT! BRINT1 |HPINT | INT3 | TXINT | TRINT |BXINTO |BRINTO | TINT | INT2 | INT1 | INTO 
(g) 549 IMR 

is4 15 12 11 10 9 8 7 6 5 4 3 2 1 «0 


} Resvd |BMINT! BMINTO | BXINT1 |BRINT1 |HPINT | INT3 | TXINT | TRINT | BXINTO | BRINTO | TINT | INT2 | INT1 | INTO 


(h) 5402 IMR 


15 14 13 12 11 10 9 8 f 6 5 4 3 2 1 0 


TINTA 
Resvd DMACS | DMAC4 INTS | 9 DMACO| BXINTO|BRINTO| TINTO | INT2 
1 


(i) 5410 IMR 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 


BXINT1 en BXINT2 | BRINT2 
Resvd DMACS5 | DMAC4 or or | BXINTO|}BRINTO} TINT | INT2 
DMAC3 pMAce DMAC1 | DMACO 


(i) 5420 IMR 


15 14 13 12 5 4 3 2 
BXINT1 | BRINT1 BXINT2 | BRINT2 

Resvd | IPINT |DMAC5|DMAC4| _ or or or or |BXINTO)BRINTO} TINT | RSVD 
DMAC3 | DMAC2 DMAC1 | DMACO 
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6.10.3 Phase 1: Receive Interrupt Request 


Interrupt 


An interrupt is requested by a hardware device or by a software instruction. 
When an interrupt request occurs, the corresponding flag (if any) is activated 
in the IFR (see section 6.10.1, /nterrupt Flag Register (IFR), on page 6-27). 
This flag is activated whether or not the interrupt is later acknowledged by the 
processor. The flag is automatically cleared when its corresponding interrupt 
is taken. 


Lj Hardware interrupt requests. External hardware interrupts are requested 
by signals at external interrupt ports, and internal hardware interrupts are 
requested by signals from the on-chip peripherals. For example, on the 
541, hardware interrupts can be requested by or through: 


Pins INT3 through INTO 

Pins RS (reset) and NMI 

The serial ports interrupts (RINTO and XINTO or RINT1 and XINT1) 
The timer interrupt (TINT) 


Table 6—19 through Table 6-24 (pages 6-38 through 6-43) list the interrupt 
sources for each ’54x device. 


(1 Software interrupt requests. A software interrupt is requested by one of the 
following program instructions: 


INTR. This instruction allows you to execute any interrupt service rou- 
tine. The instruction operand (K) indicates which interrupt vector loca- 
tion the CPU branches to. Table 6—19 through Table 6—24 (pages 
6-38 through 6-43) show the operand K used to refer to each vector 
location. When an INTR interrupt is acknowledged, the interrupt mode 
bit (INTM) in ST1 is set to 1 to disable maskable interrupts. 


TRAP. This instruction performs the same function as the INTR 
instruction without setting the INTM bit. Table 6-19 through 
Table 6-24 (pages 6-38 through 6-43) show the operand K used to 
refer to each vector location. 


RESET. This instruction performs a nonmaskable software reset that 
can be used any time to put the 54x into a known state. The RESET 
instruction affects STO and ST1, but does not affect PMST. For a 
summary of the registers and bits affected, see the description of the 
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RESET instruction in TMS320C54x DSP Reference Set, Volume 2: 
Mnemonic Instruction Set or Volume 3: Algebraic Instruction Set. 
When the RESET instruction is acknowledged, INTM is set to 1 to dis- 
able maskable interrupts. The initialization of IPTR and the peripheral 
registers is different from the initialization done by a hardware reset 
(see section 6.9, Reset Operation, on page 6-25). 


6.10.4 Phase 2: Acknowledge Interrupt 


Interrupt 


After an interrupt has been requested by hardware or software, the CPU must 
decide whether to acknowledge the request. Software interrupts and non- 
maskable hardware interrupts are acknowledged immediately. Maskable 
hardware interrupts are acknowledged only after certain conditions are met: 


(1 Priority is highest. When more than one hardware interrupt is requested 
at the same time, the ’54x services them according to a set priority ranking 
in which 1 indicates the highest priority. Table 6-19 through Table 6-24 
show the priorities for the hardware interrupts. 


Ly INTM bit is 0. The interrupt mode bit (INTM), which is in ST1, enables or 
disables all maskable interrupts: 


m@ When INTM =0, all unmasked interrupts are enabled. 
m@ When INTM = 1, all unmasked interrupts are disabled. 


INTM is set to 1 automatically when an interrupt is taken. If the program 
exits the interrupt service routine (ISR) using the RETE instruction (return 
from interrupt with automatic reenable), INTM is reenabled (cleared). 
INTM can also be set with a hardware reset (RS) or by executing an SSBX 
INTM instruction (disable interrupt). INTM is reset by executing the RSBX 
INTM instruction (enable interrupt). INTM does not actually modify IMR or 
IFR. 


(1 IMR mask bit is 1. Each of the maskable interrupts has its own mask bit 
in the IMR. To enable an interrupt, set its mask bit to 1. See section 6.10.2, 
Interrupt Mask Register (IMR), on page 6-29. 


The CPU acknowledges a maskable hardware interrupt, it jams the instruction 
bus with the INTR instruction. This instruction forces PC to the appropriate 
address and fetches the software vector. As the CPU fetches the first word of 
the software vector, it generates the [ACK signal, which clears the appropriate 
interrupt flag bit. 
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For enabled interrupts, when IACK occurs, the interrupt number is indicated 
by address bits A6—A2 on the rising edge of CLKOUT. If the interrupt vectors 
reside in on-chip memory and you want to observe the addresses, the ’54x 
must operate in address visibility mode (AVIS = 1) so that the interrupt number 
can be decoded. If an interrupt occurs while the ’54x is on hold and HM = 0, 
the address cannot be present when IACK becomes active. 


6.10.5 Phase 3: Execute Interrupt Service Routine (ISR) 


Interrupt 


After acknowledging the interrupt, the CPU: 


1) Stores the program counter (PC) value (the return address) to the top of 
the stack in data memory 


eT A ASAE, | 
Note: 


The program counter extension register, XPC, does not get pushed to the top 
of the stack; that is, it does not get saved on the stack. Therefore, if an ISR 
is located on a different page from the vector table, you must push the XPC 
on the stack prior to branching to the ISR. A FRET [E] can be used to return 
from the ISR. 


|) 


2) Loads the PC with the address of the interrupt vector 


3) Fetches the instruction located at the vector address. (If the branch is 
delayed and you also stored one 2-word instruction or two 1-word instruc- 
tions, the CPU also fetches these words.) 


4) Executes the branch, which leads it to the address of your ISR. (If the 
branch is delayed, the additional instruction(s) are executed before the 
branch.) 


5) Executes the ISR until a return instruction concludes the ISR 


6) Follows the stack pointer (SP) to the top of the stack, and pops the return 
address off the stack and into PC 


7) Continues executing the main program 


To determine which vector address has been assigned to each of the inter- 
rupts, refer to the table for your specific 54x device in section 6.10.10, /nterrupt 
Tables, on page 6-38. Interrupt addresses are spaced four locations apart so 
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that a delayed branch instruction and two 1-word instructions or one 2-word 
instruction can be accommodated in those locations. 


6.10.6 Interrupt Context Save 


When an interrupt service routine is executed, certain registers must be saved 
onto the stack. When the program returns from the ISR (by an RC[D], 
RETE[D], or RETF[D]), your software code must restore the contents of these 
registers. You can manage stack storage as long as the stack does not exceed 
the memory space. This stack is also used for subroutine calls; the 54x sup- 
ports subroutine calls within the ISR. Because the CPU registers and peripher- 
al registers are memory-mapped, the PSHM and POPM instructions can 
transfer these registers to and from the stack. In addition, the PSHD and POPD 
instructions can transfer data-memory values to and from the stack. 


There are a number of special considerations that you must follow when doing 
context saves and restores. The first consideration is that when you use the 
stack to save the context you must perform the restore in the exact reverse 
order. The second consideration is that BRC should be restored prior to restor- 
ing the BRAF bit in ST1. If you fail to follow this order, the BRAF bit will be 
cleared, if BRC = 0 before BRC is restored. 


6.10.7 Interrupt Latency 


The 54x completes all instructions in the pipeline except the instructions in the 
prefetch and fetch stages before executing an interrupt, so the maximum inter- 
rupt latency depends on the contents of the pipeline. See section 7.2, /nter- 
rupts and the Pipeline, on page 7-25 for more information about pipeline laten- 
cies associated with interrupts. Instructions that are extended by wait states 
for slower-memory access and repeated instructions require extra time to 
process an interrupt. 


The single-repeat instructions (RPT and RPTZ) require that all executions of 
the next instruction be completed before allowing an interrupt to execute to 
protect the context of the repeated instructions. This protection is necessary, 
because these instructions run parallel operations in the pipeline, and the con- 
text of these operations cannot be saved in the ISR. 


Since the hold function takes precedence over interrupts, it can also delay an 
interrupt trap. If an interrupt occurs when the CPU is on hold (HOLD is 
asserted) and the interrupt vector must be fetched from external memory, the 
interrupt is not taken until HOLDA is de-asserted (after the hold state ends). 
However, if the processor is in the concurrent hold mode (HM = 0) and the in- 
terrupt vector table is located in internal memory, the CPU takes the interrupt, 
regardless of HOLD. 
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Interrupts cannot be processed between the RSBX INTM instruction and the 
next instruction in a program sequence. If an interrupt occurs during the de- 
code phase of RSBX INTM, the CPU always completes RSBX INTM as well 
as the following instruction before the pending interrupt is processed. Waiting 
for these instructions to complete ensures that a return (RET) can be executed 
in an ISR before the next interrupt is processed to protect against stack over- 
flow. If an ISR ends with an RETE instruction (return from ISR with enable), the 
RSBX INTM instruction is unnecessary. Similar to an RSBX INTM instruction, 
an SSBX INTM instruction and the instruction that follows it cannot be inter- 
rupted. 


ls ns seen ss SS ACA: | 
Note: 


Reset (RS) is not delayed by multicycle instructions. NMI can be delayed by 
multicycle instructions and by HOLD. 


6.10.8 Interrupt Operation: A Quick Summary 


Once an interrupt has been passed to the CPU, the CPU operates in the follow- 
ing manner (see Figure 6—5 on page 6-37): 


(1 +If a maskable interrupt is requested: 
1) The corresponding bit in the IFR is set. 


2) The acknowledgment conditions (INTM =0 and IMR bit =1) are tested. 
If the conditions are true, the CPU acknowledges the interrupt, gener- 
ating an IACK signal; otherwise, it ignores the interrupt and continues 
with the main program. 


3) When the interrupt has been acknowledged, its flag bit in the IFR is 
cleared to 0 and the INTM bit is set to 1 (to block other maskable inter- 
rupts). 


4) The PC is saved on the stack. 
5) The CPUbranches to and executes the interrupt service routine (ISR). 


6) The ISR is concluded by a return instruction, which pops the return 
address off the stack. 


7) The CPU continues with the main program. 


1] If anonmaskable interrupt is requested: 


1) The CPU immediately acknowledges the interrupt, generating an 
IACK signal. 


2) Ifthe interrupt was requested by RS, NMI, or the INTR instruction, the 
the INTM bit is set to 1 to block maskable hardware interrupts. 
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3) Ifthe INTR instruction has requested one of the maskable interrupts, 
the corresponding flag bit is cleared to 0. 


4) The PC is saved on the stack. 
5) The CPU branches to and executes the ISR. 


6) The ISR is concluded by a return instruction, which pops the return 
address of the stack. 


7) The CPU continues with the main program. 


Te 


Note: 


The INTR instruction disables maskable interrupts by setting the interrupt 
mode bit (INTM), but the TRAP instruction does not affect INTM. 


| 


6.10.9 Re-mapping Interrupt-Vector Addresses 


The interrupt vectors can be remapped to the beginning of any 128-word page 
in program memory except in reserved areas. The interrupt-vector address is 
generated by concatenating the interrupt-pointer (IPTR) field of PMST with the 
interrupt-vector number (0-31) shifted by 2. Consider the example of 
Figure 6—4: if INTO is asserted low and IPTR = 0001h, the interrupt vector is 
fetched from OOCOh. The interrupt-vector number for INTO is 16 or 10h. 


Figure 6-4. Interrupt-Vector Address Generation 


Vector} 0 0 O O 0 00 0 1; 1 0 0 0 0 0 
Bit] 15 14 13 12 1110 9 8 7/6 5 4 3 2 1 


Address 0 0 Cc 0 


IPTR = 0 0000 0001 INT = 40h (INTO) 


/ ee e \ 


o|;|oO 


At reset, the IPTR bits are set to 1 (IPTR = 1FFh); this value maps the vectors 
to page 511 in program-memory space. Therefore, the reset vector for hard- 
ware resets always resides at location OFF80h. The interrupt vectors can be 
mapped to another location by loading IPTR with a value other than 1FFh. For 
example, the interrupt vectors can be moved to start at location 0080h by load- 
ing IPTR with 000th. 


a | 


Note: 


The hardware reset (RS) vector cannot be remapped because the hardware 
reset loads the IPTR with 1s. Therefore, the reset vector for hardware resets 


is always fetched at location FF80h in program space. 
a | 
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Figure 6—5. Flow Diagram of Interrupt Operation 


Interrupt request received 


Interrupt 
maskable? 


Interrupt acknowledged; 
IACK generated 


Hardware interrupt 
or 
INTR instruction? 


N 


oO 
W 
PC saved on software stack ; 
W 
Interrupt service routine run 
WV 
Return instruction restores PC 
ne | 


W 
Main program continues 


Program Memory Addressing 6-37 


Interrupts 


6.10.10 Interrupt Tables 


Table 6-19 through Table 6-24 show the interrupt trap number, priority, and 
location for each 54x device. 


Table 6-19. °541 Interrupt Locations and Priorities 


TRAP/INTR Location 


Number (K) Priority Name (Hex) Function 

0 1 RS/SINTR 0 Reset (hardware and software reset) 
1 2 NMI/SINT16 4 Nonmaskable interrupt 

2 - SINT17 8 Software interrupt #17 

3 - SINT18 C Software interrupt #18 

4 - SINT19 10 Software interrupt #19 

5 - SINT20 14 Software interrupt #20 

6 = SINT21 18 Software interrupt #21 

7 - SINT22 1C Software interrupt #22 

8 - SINT23 20 Software interrupt #23 

9 - SINT24 24 Software interrupt #24 

10 - SINT25 28 Software interrupt #25 

11 - SINT26 2C Software interrupt #26 

12 - SINT27 30 Software interrupt #27 

13 - SINT28 34 Software interrupt #28 

14 - SINT29 38 Software interrupt #29; reserved 
15 - SINT30 3C Software interrupt #30; reserved 
16 8 INTO/SINTO 40 External user interrupt #0 

17 4 INT1/SINT1 44 External user interrupt #1 

18 5 INT2/SINT2 48 External user interrupt #2 

19 6 TINT/SINT3 4C Internal timer interrupt 
20 v RINTO/SINT4 50 Serial port 0 receive interrupt 
21 8 XINTO/SINT5 54 Serial port 0 transmit interrupt 
22 9 RINT1/SINT6 58 Serial port 1 receive interrupt 
23 10 XINT1/SINT7 5C Serial port 1 transmit interrupt 
24 11 INT3/SINT8 60 External user interrupt #3 

25-31 - 64—7F Reserved 


Table 6-20. °542 Interrupt Locations and Priorities 


TRAP/INTR 
Number (K) 


0 
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Priority 
1 
2 
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Name 
RS/SINTR 


NMI/SINT16 
SINT17 
SINT18 
SINT19 
SINT20 
SINT21 
SINT22 
SINT23 
SINT24 
SINT25 
SINT26 
SINT27 
SINT28 
SINT29 
SINT30 
INTO/SINTO 
INTT/SINT1 
INT2/SINT2 
TINT/SINT3 
BRINTO/SINT4 
BXINTO/SINT5 
TRINT/SINT6 
TXINT/SINT7 
INT3/SINT8 
HPINT/SINT9 


Location 


(Hex) 
0 
4 
8 
Cc 
10 
14 
18 
1C 
20 
24 
28 
2c 
30 
34 
38 
3C 
40 
44 
48 
4C 
50 
54 
58 
5C 
60 
64 
68-7F 
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Function 


Reset (hardware and software reset) 
Nonmaskable interrupt 

Software interrupt #17 

Software interrupt #18 

Software interrupt #19 

Software interrupt #20 

Software interrupt #21 

Software interrupt #22 

Software interrupt #23 

Software interrupt #24 

Software interrupt #25 

Software interrupt #26 

Software interrupt #27 

Software interrupt #28 

Software interrupt #29, reserved 
Software interrupt #30, reserved 
External user interrupt #0 

External user interrupt #1 

External user interrupt #2 

Internal timer interrupt 

Buffered serial port receive interrupt 
Buffered serial port transmit interrupt 
TDM serial port receive interrupt 
TDM serial port transmit interrupt 
External user interrupt #3 

HPI interrupt 


Reserved 
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Table 6-21. 543 Interrupt Locations and Priorities 


TRAP/INTR Location 
Number (K) Priority Name (Hex) Function 

0 1 RS/SINTR 0 Reset (hardware and software reset) 
1 2 NMI/SINT16 4 Nonmaskable interrupt 

2 - SINT17 8 Software interrupt #17 

3 - SINT18 C Software interrupt #18 

4 - SINT19 10 Software interrupt #19 

5 - SINT20 14 Software interrupt #20 

6 = SINT21 18 Software interrupt #21 

7 - SINT22 1C Software interrupt #22 

8 - SINT23 20 Software interrupt #23 

9 - SINT24 24 Software interrupt #24 

10 - SINT25 28 Software interrupt #25 

11 - SINT26 2C Software interrupt #26 

12 - SINT27 30 Software interrupt #27 

13 - SINT28 34 Software interrupt #28 

14 - SINT29 38 Software interrupt #29, reserved 

15 - SINT30 3C Software interrupt #30, reserved 

16 3 INTO/SINTO 40 External user interrupt #0 

17 4 INT1/SINT1 44 External user interrupt #1 

18 5 INT2/SINT2 48 External user interrupt #2 

19 6 TINT/SINT3 4C Internal timer interrupt 
20 7 BRINTO/SINT4 50 Buffered serial port receive interrupt 
21 8 BXINTO/SINT5 54 Buffered serial port transmit interrupt 
22 9 TRINT/SINT6 58 TDM serial port receive interrupt 
23 10 TXINT/SINT7 5C TDM serial port transmit interrupt 
24 aim INT3/SINT8 60 External user interrupt #3 

25-31 - 64—-7F Reserved 
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Table 6-22. °545 Interrupt Locations and Priorities 


TRAP/INTR 
Number (K) 


0 
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Priority 
1 
2 
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Name 
RS/SINTR 


NMI/SINT16 
SINT17 
SINT18 
SINT19 
SINT20 
SINT21 
SINT22 
SINT23 
SINT24 
SINT25 
SINT26 
SINT27 
SINT28 
SINT29 
SINT30 
INTO/SINTO 
INTT/SINT1 
INT2/SINT2 
TINT/SINT3 
BRINTO/SINT4 
BXINTO/SINT5 
RINT1/SINT6 
XINT1/SINT7 
INT3/SINT8 
HPINT/SINT9 


Location 


(Hex) 
0 
4 
8 
Cc 
10 
14 
18 
1C 
20 
24 
28 
2C 
30 
34 
38 
3C 
40 
44 
48 
4C 
50 
54 
58 
5C 
60 
64 
68-7F 
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Function 


Reset (hardware and software reset) 
Nonmaskable interrupt 

Software interrupt #17 

Software interrupt #18 

Software interrupt #19 

Software interrupt #20 

Software interrupt #21 

Software interrupt #22 

Software interrupt #23 

Software interrupt #24 

Software interrupt #25 

Software interrupt #26 

Software interrupt #27 

Software interrupt #28 

Software interrupt #29, reserved 
Software interrupt #30, reserved 
External user interrupt #0 

External user interrupt #1 

External user interrupt #2 

Internal timer interrupt 

Buffered serial port receive interrupt 
Buffered serial port transmit interrupt 
Serial port receive interrupt 

Serial port transmit interrupt 
External user interrupt #3 

HPI interrupt 


Reserved 
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Table 6-23. ‘546 Interrupt Locations and Priorities 


TRAP/INTR Location 
Number (K) Priority Name (Hex) Function 

0 1 RS/SINTR 0 Reset (hardware and software reset) 
1 2 NMI/SINT16 4 Nonmaskable interrupt 

2 - SINT17 8 Software interrupt #17 

3 - SINT18 C Software interrupt #18 

4 - SINT19 10 Software interrupt #19 

5 - SINT20 14 Software interrupt #20 

6 = SINT21 18 Software interrupt #21 

7 - SINT22 1C Software interrupt #22 

8 - SINT23 20 Software interrupt #23 

9 - SINT24 24 Software interrupt #24 

10 - SINT25 28 Software interrupt #25 

11 - SINT26 2C Software interrupt #26 

12 - SINT27 30 Software interrupt #27 

13 - SINT28 34 Software interrupt #28 

14 - SINT29 38 Software interrupt #29, reserved 

15 - SINT30 3C Software interrupt #30, reserved 

16 3 INTO/SINTO 40 External user interrupt #0 

17 4 INT1/SINT1 44 External user interrupt #1 

18 5 INT2/SINT2 48 External user interrupt #2 

19 6 TINT/SINT3 4C Internal timer interrupt 
20 7 BRINTO/SINT4 50 Buffered serial port receive interrupt 
21 8 BXINTO/SINT5 54 Buffered serial port transmit interrupt 
22 9 RINT1/SINT6 58 Serial port receive interrupt 
23 10 XINT1/SINT7 5C Serial port transmit interrupt 
24 aim INT3/SINT8 60 External user interrupt #3 

25-31 - 64—-7F Reserved 


6-42 


Table 6-24. 548 Interrupt Locations and Priorities 


TRAP/INTR 
Number (K) 


0 
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Priority 
1 
2 
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Name 
RS/SINTR 
NMI/SINT16 
SINT17 
SINT18 
SINT19 
SINT20 
SINT21 
SINT22 
SINT23 
SINT24 
SINT25 
SINT26 
SINT27 
SINT28 
SINT29 
SINT30 
INTO/SINTO 
INT1/SINT1 
INT2/SINT2 
TINT/SINT3 
BRINTO/SINT4 
BXINTO/SINT5 
TRINT/SINT6 
TXINT/SINT7 
INT3/SINT8 
HPINT/SINT9 
BRINT1/SINT10 
BXINT1/SINT11 


Location 


(Hex) 
0 
4 
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Function 

Reset (hardware and software reset) 
Nonmaskable interrupt 

Software interrupt #17 

Software interrupt #18 

Software interrupt #19 

Software interrupt #20 

Software interrupt #21 

Software interrupt #22 

Software interrupt #23 

Software interrupt #24 

Software interrupt #25 

Software interrupt #26 

Software interrupt #27 

Software interrupt #28 

Software interrupt #29, reserved 
Software interrupt #30, reserved 
External user interrupt #0 

External user interrupt #1 

External user interrupt #2 

Internal timer interrupt 

Buffered serial port 0 receive interrupt 
Buffered serial port 0 transmit interrupt 
TDM serial port receive interrupt 
TDM serial port transmit interrupt 
External user interrupt #3 

HPI interrupt 

Buffered serial port 1 receive interrupt 
Buffered serial port 1 transmit interrupt 


Reserved 
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Table 6-25. °549 Interrupt Locations and Priorities 


TRAP/INTR Location 
Number (K) Priority Name (Hex) Function 

0 1 RS/SINTR 0 Reset (hardware and software reset) 
1 2 NMI/SINT16 4 Nonmaskable interrupt 

2 - SINT17 8 Software interrupt #17 

3 - SINT18 Cc Software interrupt #18 

4 - SINT19 10 Software interrupt #19 

5 - SINT20 14 Software interrupt #20 

6 = SINT21 18 Software interrupt #21 

7 - SINT22 1C Software interrupt #22 

8 - SINT23 20 Software interrupt #23 

9 = SINT24 24 Software interrupt #24 

10 - SINT25 28 Software interrupt #25 

11 - SINT26 2C Software interrupt #26 

12 - SINT27 30 Software interrupt #27 

13 - SINT28 34 Software interrupt #28 

14 - SINT29 38 Software interrupt #29 

15 - SINT30 3C Software interrupt #30 

16 $} INTO/SINTO 40 External user interrupt #0 

17 4 INT1/SINT1 44 External user interrupt #1 

18 5 INT2/SINT2 48 External user interrupt #2 

19 6 TINT/SINT3 4C Internal timer interrupt 
20 iu BRINTO/SINT4 50 Buffered serial port 0 receive interrupt 
21 8 BXINTO/SINT5 54 Buffered serial port 0 transmit interrupt 
22 9 TRINT/SINT6 58 TDM serial port receive interrupt 
23 10 TXINT/SINT7 5C TDM serial port transmit interrupt 
24 11 INT3/SINT8 60 External user interrupt #3 
25 12 HINT/SINT9 64 HPI interrupt 
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Table 6-25. °549 Interrupt Locations and Priorities (Continued) 


TRAP/INTR 
Number (K) 


26 
27 
28 


29 


30-31 


Table 6-26. 


TRAP/INTR 
Number (K) 


0 
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Priority 
13 
14 
15 


Name 
BRINT1/SINT10 


BXINT1/SINT11 
BMINTO/SINT12 


BMINT1/SINT13 


Location 
(Hex) 


68 
6C 
70 


74 


78-7F 


5402 Interrupt Locations and Priorities 


Priority 
1 
2 


Name 
RS/SINTR 


NMI/SINT16 
SINT17 
SINT18 
SINT19 
SINT20 
SINT21 
SINT22 
SINT23 
SINT24 
SINT25 
SINT26 
SINT27 
SINT28 
SINT29 
SINT30 


Location 
(Hex) 


0 
4 
8 
C 
10 
14 
18 
1C 
20 
24 
28 
2c 
30 
34 
38 
3C 


Function 


Buffered serial port 1 receive interrupt 
Buffered serial port 1 transmit interrupt 


BSP #0 misalignment detection 
interrupt (549 only) 


BSP #1 misalignment detection 
interrupt (549 only) 


Reserved 


Function 


Reset (hardware and software reset) 
Nonmaskable interrupt 
Software interrupt #17 
Software interrupt #18 
Software interrupt #19 
Software interrupt #20 
Software interrupt #21 
Software interrupt #22 
Software interrupt #23 
Software interrupt #24 
Software interrupt #25 
Software interrupt #26 
Software interrupt #27 
Software interrupt #28 
Software interrupt #29 


Software interrupt #30 


Program Memory Addressing 6-45 


Interrupts 


Table 6-26. °5402 Interrupt Locations and Priorities (Continued) 


TRAP/INTR 
Number (K) 


16 
17 
18 
19 
20 
21 
22 
23 


24 
25 
26 


27 


28 
29 
120-127 


Priority 


15 
16 


Name 
INTO/SINTO 


INT1/SINT1 
INT2/SINT2 
TINTO/SINT3 
BRINTO/SINT4 
BXINTO/SINT5 
DMACO/SINT7 


TINT1/DMAC1/ 
SINT7 


INT3/SINT8 
HPINT/SINT9 


BRINT1/DMAC2/ 
SINT10 


BXINT1/DMAC3/ 
SINT11 


DMAC4/SINT12 
DMAC5/SINT13 


Reserved 


Location 
(Hex) 


40 
44 
48 
4C 
50 
54 
58 
5C 


60 
64 
68 


6C 


70 
74 
78-7F 


Table 6-27. ‘5410 Interrupt Locations and Priorities 


TRAP/INTR 
Number (K) 


- WO ND 


Priority 
1 
2 


Name 
RS/SINTR 


NMI/SINT16 
SINT17 
SINT18 
SINT19 


Location 
(Hex) 


0 
4 
8 
C 
10 


Function 


External user interrupt #0 
External user interrupt #1 
External user interrupt #2 
TimerO interrupt 

McBSP #0 receive interrupt 
McBSP #0 transmit interrupt 
DMA channel 0 interrupt 


Timer interrupt (default) or DMA 
channel 1 interrupt. 


External user interrupt #3 
HPI interrupt 


McBSP #1 receive interrupt (default) 
or DMA channel 2 interrupt 


McBSP #1 transmit interrupt (default) 
or DMA channel 3 interrupt 


DMA channel 4 interrupt 
DMA channel 5 interrupt 


Reserved 


Function 


Reset (hardware and software reset) 
Nonmaskable interrupt 
Software interrupt #17 
Software interrupt #18 


Software interrupt #19 
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Table 6-27. °5410 Interrupt Locations and Priorities (Continued) 


TRAP/INTR 
Number (K) 


5 
6 
7 
8 
9 


10 


Priority Name 


SINT20 
SINT21 
SINT22 
SINT23 
SINT24 
SINT25 
SINT26 
SINT27 
SINT28 
SINT29 
SINT30 
INTO/SINTO 
INTT/SINT1 
INT2/SINT2 
TINT/SINT3 
BRINTO/SINT4 
BXINTO/SINT5 


BRINT2/DMACO 


BXINT2/DMAC1 


INT3/SINT8 
HPINT/SINT9 


BRINT1/DMAC2/ 
SINT10 


BXINT1/DMAC3/ 
SINT11 


Location 
(Hex) 


14 
18 
1C 
20 
24 
28 
2C 
30 
34 
38 
3C 
40 
44 
48 
4C 
50 
54 


58 


5C 


60 
64 


68 


6C 


Function 


Software interrupt #20 

Software interrupt #21 

Software interrupt #22 

Software interrupt #23 

Software interrupt #24 

Software interrupt #25 

Software interrupt #26 

Software interrupt #27 

Software interrupt #28 

Software interrupt #29 

Software interrupt #30 

External user interrupt #0 

External user interrupt #1 

External user interrupt #2 

TimerO interrupt 

McBSP #0 receive interrupt (default) 
McBSP #0 transmit interrupt (default) 


McBSP #2 receive interrupt (default) 
or DMA channel 0 interrupt 


MCBSP #2 transmit interrupt (default) 
or DMA channel 1 interrupt. 


External user interrupt #3 
HPI interrupt 


McBSP #1 receive interrupt (default) 
or DMA channel 2 interrupt. 


McBSP #1 transmit interrupt (default) 
or DMA channel 3 interrupt. 
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Table 6-27. °5410 Interrupt Locations and Priorities (Continued) 


TRAP/INTR 
Number (K) 


28 
29 
120-127 


Priority 
15 
16 


Name 
DMAC4/SINT12 


DMAC5/SINT13 


Reserved 


Location 
(Hex) 


70 
74 


Table 6-28. °5420 Interrupt Locations and Priorities 


TRAP/INTR 
Number (K) 


0 


oOo Oa N OD a FF WO PE = 


- ar ina... lr aS... lUrrae 
oOo N Oo a fF WO PO + OO 


Priority 
1 
2 


Name 
RS/SINTR 


NMI/SINT16 
SINT17 
SINT18 
SINT19 
SINT20 
SINT21 
SINT22 
SINT23 
SINT24 
SINT25 
SINT26 
SINT27 
SINT28 
SINT29 
SINT30 
INTO/SINTO 
INT1/SINT1 
INT2/SINT2 


Location 


(Hex) 
0 
4 
8 
Cc 
10 
14 
18 
1 
20 
24 
28 
2c 
30 
34 
38 
3C 
40 
44 
48 


Function 


DMA channel 4 interrupt 
DMA channel 5 interrupt 


Reserved 


Function 


Reset (hardware and software reset) 
Nonmaskable interrupt 
Software interrupt #17 
Software interrupt #18 
Software interrupt #19 
Software interrupt #20 
Software interrupt #21 
Software interrupt #22 
Software interrupt #23 
Software interrupt #24 
Software interrupt #25 
Software interrupt #26 
Software interrupt #27 
Software interrupt #28 
Software interrupt #29 
Software interrupt #30 
External user interrupt #0 
External user interrupt #1 


Reserved 
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Table 6-28. 5420 Interrupt Locations and Priorities (Continued) 


TRAP/INTR 
Number (K) 


19 
20 
21 
22 


23 


24 
25 
26 


27 


28 

29 

30 
124-127 


Priority Name 


6 
iT 
8 
9 
10 


il 
12 
13 


14 


15 
16 
17 


TINT/SINT3 
BRINTO/SINT4 
BXINTO/SINT5 
BRINT2/DMACO 


BXINT2/DMAC1 


INT3/SINT8 
HPINT/SINT9 
BRINT1/DMAC2 


BXINT1/DMAC3 


DMAC4/SINT12 
DMAC5/SINT13 
IPINT/SINT14 


Location 
(Hex) 


4C 
50 
54 
58 


5C 


60 
64 
68 


6C 


70 

74 

78 
7C-7F 


Function 


External timer 
McBSP #0 receive interrupt (default) 
McBSP #0 transmit interrupt (default) 


McBSP #2 receive interrupt (default) 
or DMA channel 0 interrupt 


MCBSP #2 transmit interrupt (default) 
or DMA channel 1 interrupt. 


Reserved 
HPI interrupt (from DSPINT in HPIC) 


McBSP #1 receive interrupt (default) 
or DMA channel 2 interrupt. 


McBSP #1 transmit interrupt (default) 
or DMA channel 3 interrupt. 


DMA channel 4 interrupt 
DMA channel 5 interrupt 
Interprocessor interrupt 


Reserved 
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6.11 Power-Down Modes 


The ’54x has power-down modes in which it enters a dormant state and dissi- 
pates less power than normal operation while maintaining the CPU contents. 
This allows operations to continue unaltered when the power-down mode is 
terminated. 


You can invoke one of the power-down modes either by executing the IDLE 1, 
IDLE 2 or IDLE 3 instructions, or by driving the HOLD signal low with the HM 
status bit set to 1. Power-down operation is summarized in Table 6-29 and 
described in detail in sections 6.11.1 through 6.11.5. 


Table 6-29. Operation During the Four Power-Down Modes 


Operation/Feature IDLE1 IDLE2 IDLE3 HOLD 
CPU halted Yes Yes Yes Yest 
CPU clock stopped Yes Yes Yes No 
Peripheral clock stopped No Yes Yes No 
Phase-locked loop (PLL) stopped No No Yes No 
External address lines put in high-impedance state No No No Yes 
External data lines put in high-impedance state No No No Yes 
External control signals put in high-impedance state No No No Yes 
Power-down terminated by: 
HOLD driven high No No No Yes 
Unmasked internal hardware interrupts Yes No No No 
Unmasked external hardware interrupts Yes Yes Yes No 
NMI Yes Yes Yes No 
RS Yes Yes Yes No 


t Depending on the state of the HM bit, the CPU continues to execute unless the execution requires an external memory access. 


6.11.1 IDLE1 Mode 


The IDLE1 mode halts all CPU activities except the system clock. Because the 
system clock remains applied to the peripheral modules, the peripheral circuits 
continue operating and the CLKOUT pin remains active. Thus, peripherals 
such as serial ports and timers can take the CPU out of its power-down state. 


Use the IDLE 1 instruction to enter the IDLE1 mode. To terminate IDLE1, use 
a wake-up interrupt. If INTM = 0 when the wake-up interrupt takes place, the 


6.11.2 IDLE2 Mode 


6.11.3 IDLE3 Mode 


Power-Down Modes 


54x enters the ISR when IDLE1 is terminated. If INTM = 1, the 54x continues 
with the instruction following the IDLE 1 instruction. All wake-up interrupts 
must set to enable the corresponding bits in the IMR register regardless of the 
INTM value. The only exceptions are the nonmaskable interrupts, RS and 
NMI. 


The IDLE2 mode halts the on-chip peripherals as well as the CPU. Because 
the on-chip peripherals are stopped in this mode, they cannot be used to 
generate the interrupt to wake up the 54x as with IDLE1. However, power is 
significantly reduced because the device is completely stopped. 


Use the IDLE 2 instruction to enter the IDLE2 mode. To terminate IDLE2, 
activate any of the external interrupt pins (RS, NMI, and INTx) with a 10-ns 
minimum pulse. If INTM = 0 when the wake-up interrupt takes place, the 54x 
enters the ISR when IDLE2 is terminated. If INTM = 1, the 54x continues with 
the instruction following IDLE 2 instruction. All wake-up interrupts must be set 
to enable the corresponding bits in the IMR register regardless of the INTM 
value. Reset all peripherals when IDLE2 terminates, especially if they are 
externally clocked. 


When RS is the wake-up interrupt in IDLE2, a 10-ns minimum pulse of RS can 
activate the reset sequence. 


The IDLE3 mode functions like IDLE2 but it also halts the PLL. IDLE3 is used 
for a complete shutdown of the 54x. This mode reduces power dissipation 
more than IDLE2. Furthermore, the IDLE3 state allows you to reconfigure the 
PLL externally if the system requires the 54x to operate at a lower speed to 
save power. 


Use the IDLE 3 instruction to enter the IDLE3 mode. To terminate IDLE3, 
activate any of the external interrupt pins (RS, NMI, and INTx) with a 10-ns 
minimum pulse. If INTM = 0 when the wake-up interrupt takes place, the ’54x 
enters the ISR when IDLE3 is terminated. If INTM = 1, the 54x continues with 
the instruction following the IDLE 3 instruction. All wake-up interrupts should 
be set to enable the corresponding bits on the IMR register, regardless of the 
INTM value. Reset all peripherals when IDLE3 terminates, especially if they 
are externally clocked. 


To terminate IDLE3, the external interrupt must be a minimum of 10 ns to 
activate the wake-up sequence. The ’54x can accept multiple interrupts during 
the wake-up sequence; the interrupt with highest priority is serviced first after 
IDLE3. See section 10.5.2, IDLE3, on page 10-26, and section 8.5.2, Soft- 
ware-Programmable PLL (Available on TMS320C545/546/548), on page 8-27 
for more details on PLL lockup time requirements. 
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When RS is the wake-up interrupt in IDLE3, a 10-ns minimum pulse of RS can 
activate the reset sequence. However, RS should be kept active for 50 us so 
that the PLL can secure and provide stable system clock to internal logic. 


6.11.4 Hold Mode 


The Hold mode is another power-down mode. It enables you to put the 
address, data, and control lines into the high-impedance state. Depending on 
the value of the HM bit, you can also use this mode to halt the CPU. 


This power-down mode is initiated by the HOLD signal. The effect of HOLD 
depends on the value of HM. If HM = 1, the CPU stops executing and address, 
data, and control lines go into the high-impedance state for further power 
reduction. If HM = 0, the address, data, and control signals are put into the 
high-impedance state, but the CPU continues to execute internally. You can 
use HM = 0 with the HOLD signal when your system does not require external- 
memory accesses. The ’54x continues to operate normally unless an off-chip 
access is required by an instruction; then the processor halts until HOLD is 
released. 


This mode does not stop the operation of on-chip peripherals (such as timers 
and serial ports); they continue to operate regardless of the HOLD level or the 
condition of the HM bit. 


This mode is terminated when HOLD becomes inactive. 


6.11.5 Other Power-Down Capabilities 


The ’54x has two other functions that affect the power-down operation: exter- 
nal bus off and CLKOUT off. 


(j External bus off allows the ’54x to disable the internal clock of external 
interfaces, thus placing the interface into a lower power-consumption 
mode. 


The external interface clock is disabled by setting bit 0 of the bank-switch- 
ing control register (BSCR) to 1. At reset, this bit is cleared to 0 and the 
external interface clock is enabled. See section 10.3.2, Bank-Switching 
Logic, on page 10-8, for more information. 


(j CLKOUT off allows the 54x to disable CLKOUT using software instruc- 
tions. The CLKOFF bit of PMST determines whether CLKOUT is enabled 
or disabled. See section 4.1.2, Processor Mode Status Register (PMST), 
on page 4-6. At reset, CLKOUT is enabled. 
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Pipeline 


This chapter describes the 54x pipeline operation and lists the pipeline latency 
cycles for operations with various registers. 
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7.1 Pipeline Operation 
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The '54x CPU has a six-level deep instruction pipeline. The six stages of the 
pipeline are independent of each other, which allows overlapping execution of 
instructions. During any given cycle, from one to six different instructions can 
be active, each at a different stage of completion. 


The six levels and functions of the pipeline structure are: 


L 


L 


Program prefetch. Program address bus (PAB) is loaded with the address 
of the next instruction to be fetched. 


Program fetch. An instruction word is fetched from the program bus (PB) 
and loaded into the instruction register (IR). This completes an instruction 
fetch sequence that consists of this and the previous cycle. 


Decode. The contents of the instruction register (IR) are decoded to deter- 
mine the type of memory access operation and the control sequence at 
the data-address generation unit (DAGEN) and the CPU. 


Access. DAGEN outputs the read operand’s address on the data address 
bus, DAB. If a second operand is required, the other data address bus, 
CAB, is also loaded with an appropriate address. Auxiliary registers in indi- 
rect addressing mode and the stack pointer (SP) are also updated. This 
is considered the first of the 2-stage operand read sequence. 


Read. The read data operand(s), if any, are read from the data buses, DB 
and CB. This completes the two-stage operand read sequence. At the 
same time, the two-stage operand write sequence begins. The data 
address of the write operand, if any, is loaded into the data write address 
bus (EAB). For memory-mapped registers, the read data operand is read 
from memory and written into the selected memory-mapped registers 
using the DB. 


Execute. The operand write sequence is completed by writing the data 
using the data write bus (EB). The instruction is executed in this phase. 


Figure 7-1 shows the six stages of the pipeline and the events that occur in 
each stage. 


The first two stages of the pipeline, prefetch and fetch, are the instruction fetch 
sequence. In one cycle, the address of a new instruction is loaded. In the 
following cycle, an instruction word is read. In case of multiword instructions, 
several such instruction fetch sequences are needed. 
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Figure 7-1. Pipeline Stages 


Loads PAB with 
the PC’s contents 


Loads IR with the contents Loads DB with the data1 
of PB read operand 
Decodes the IR’s contents Loads CB with the data2 


read operand 


Loads EAB with the data3 
write address, if required 


Loads PB with the Loads DAB with the data1 read Executes the instruction 
fetched instruction address, if required and loads EB with write 
word Loads CAB with the data2 read data 


address, if required 


Updates auxiliary registers and 
stack pointer 


Time > 


During the third stage of the pipeline, decode, the fetched instruction is 
decoded so that appropriate control sequences are activated for proper 
execution of the instruction. 


The next two pipeline stages, access and read, are an operand read 
sequence. If required by the instruction, the data address of one or two oper- 
ands are loaded in the access phase and the operand or operands are read 
in the following read phase. 


Any write operation is spread over two stages of the pipeline, the read and 
execute stages. During the read phase, the data address of the write operand 
is loaded onto EAB. In the following cycle, the operand is written to memory 
using EB. 


Each memory access is performed in two phases by the ’54x pipeline. In the 
first phase, an address bus is loaded with the memory address. In the second 
phase, a corresponding data bus reads from or writes to that memory address. 
Figure 7-2 shows how various memory accesses are performed by the ’54x 
pipeline. It is assumed that all memory accesses in the figure are performed 
by single-cycle, single-word instructions to on-chip dual-access memory. The 
on-chip dual-access memory can actually support two accesses in a single 
pipeline cycle. This is discussed in section 7.3, Dual-Access Memory and the 
Pipeline, on page 7-27. 
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Figure 7-2. Pipelined Memory Accesses 


(a) Instruction word fetch (one cycle) 


Prefetch Fetch Decode Access Read Execute/Write 


(b) Instruction performing single operand read (for example, LD *AR1, A; one cycle) 


Prefetch Fetch Decode Access Read Execute/Write 


(c) Instruction performing dual-operand read (for example, MAC *AR2+, *AR3+, A or DLD *AR2, A; one cycle) 


Prefetch Fetch Decode Access Read Execute/Write 
Load DAB Read from DB 
and CAB and CB 


(d) Instruction performing single-operand write (for example, STH A, *AR1; one cycle) 


Prefetch Fetch Decode Access Read Execute/Write 


(e) Instruction performing dual-operand write, (for example, DST A, *AR1; two cycles) 


Prefetch Fetch Decode Access Read Execute 
| | | | | Load EAB | Write to EAB | 
Prefetch Fetch Decode Access Read Execute/Write 


(f) Instruction performing operand read and operand write (for example, ST A, *AR2 || LD *AR3, B; one cycle) 


Prefetch Fetch Decode Access Read Execute/Write 
Read from DB . 
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The following sections provide examples that demonstrate how the pipeline 
works while executing different types of instructions. Unless otherwise noted, 
all instructions shown in the examples are considered single-cycle, single- 
word instructions residing in on-chip memory. 


The pipeline is depicted in these examples as a set of staggered rows in which 
each row corresponds to one instruction word moving through the stages of 
the pipeline. Example 7—1 is a sample pipeline diagram. 


Example 7-1. Sample Pipeline Diagram 


Address Instruction 
al, a2 Bb1 This is a four-cycle, two-word branch instruction 
a3 i3 This is any one-cycle, one-word instruction 
a4 i4 This is any one cycle, one-word instruction 
b1 ji 
1 2 3 4 5 6 7 8 9 10 
Prefetch Fetch Decode Access Read Execute 
Prefetch Fetch Decode Access Read Execute 
b1 PAB = a2 | PB=b1 IR=b1 b1 
Prefetch Fetch Decode Access Read Execute 
Prefetch Fetch Decode Access Read Execute 


Prefetch Fetch Decode Access Read Execute 


jt PAB=b1| PB=j1 | IR=j1 it 


Each row in the example is labeled on the left as an instruction, an operand, 
a multicycle instruction, or a pipeline flush. The numbers across the top repre- 
sent single instruction cycles. Some cycles do not show all pipeline stages— 
this is done intentionally to avoid displaying unnecessary information. 


Each box in the example contains relevant actions that occur at that pipeline 
stage. The name of each pipeline stage is shown above the box in which the 
action occurs. 


Shading represents all instruction fetches and pipeline flushes that are neces- 
sary to complete the instruction whose operation is shown. 
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7.1.1. Branch Instructions in the Pipeline 


Example 7-2 and Example 7-3 show the pipeline’s behavior during the 
execution of a branch (B) instruction and a delayed-branch (BD) instruction, 
respectively. 


Because a branch instruction consists of two instruction words, it should take 
at least two instruction cycles to execute completely. However, a standard 
branch instruction actually takes four cycles to execute. This is illustrated in 
Example 7-2. 


Example 7-2. Branch Instruction in the Pipeline 


Address Instruction 

al, a2 Bb1 

a3 i3 

a4 i4 

bi ji 

1 2 3 4 5 6 7 8 9 10 
Prefetch Fetch Decode Access Read Execute 
B PAB = a1 PB=B IR=B B 
Prefetch Fetch Decode Access Read Execute 
b1 PAB =a2| PB=b1 | IR=bt1 ess 
Prefetch Fetch Decode Access Read Execute 
Prefetch Fetch Decode Access Read Execute 
Pipeline flush PAB = a4 | PB=i4 
Prefetch Fetch Decode Access Read Execute 

jt PAB=b1] PB=j1 | IR=j1 | it 


For the branch instruction in Example 7—2 to execute completely, the following 
events occur: 


Cycle 1: The PAB is loaded with the address of a branch instruc- 
tion. 


Cycles 2 and 3: Two words of the branch instruction are fetched. 
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Cycles 4and 5: Twomoreinstructions, i3 and i4, are fetched. Although the 
two instructions after the branch instruction, i3 and i4, are 
fetched by the ’54x, they are not allowed to move past the 
decode stage and are eventually discarded. After the se- 
cond word of the branch instruction (represented by b1 in 
the left column) is decoded, PAB is loaded with this new 
value (in cycle 5). 


Cycles 6 and 7: The two-word branch instruction enters the execution 
stage of the pipeline in cycles 6 and 7. Also, j1 is fetched 
from address b1 in cycle 6. 


Cycles 8and9: These cycles are also consumed by the same branch 
instruction since the next two instructions, i3 and i4, were 
not allowed to complete their execution; this is why a 
branch instruction takes four cycles to execute. 


Cycle 10: j1 completes execution. 


Example 7-3 shows the pipeline’s behavior for a delayed-branch instruction. 


Example 7-3. Delayed-Branch Instruction in the Pipeline 


Address Instruction 

al, a2 BD b1 

a3 i3 

a4 i4 

b1 ji 

1 2 3 4 5 6 7 8 9 10 
Prefetch Fetch Decode Access Read Execute 
BD |PAB=ai1 ] PB=BD | IR=BD BD 
Prefetch Fetch Decode Access Read Execute 
b1 PAB = a2 | PB=b1 IR=b1 b1 
Prefetch Fetch Decode Access Read Execute 
Prefetch Fetch Decode Access Read Execute 

i4 PAB = a4 | PB=i4 | =i4 a re 


Prefetch Fetch Decode Access Read Execute 
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In this case, the pipeline behaves in the same manner as it did for the regular 
branch instruction. However, the two instructions following the branch, i3 and 
i4, are allowed to complete their execution. Therefore, only cycles 6 and 7 are 
consumed by the delayed-branch instruction, making the delayed branch into 
a 2-cycle instruction. 


7.1.2 Call Instructions in the Pipeline 
A standard call instruction takes four cycles to execute. Although a standard 
call is a two-word instruction and seems to need only two cycles, it actually 


flushes the pipeline for two cycles, taking four cycles to execute. 


Example 7—4 shows the pipeline’s behavior during the execution of a call 
instruction. 


Example 7-4. Call Instruction in the Pipeline 


Address Instruction 
al, a2 CALL b1 
a3 i3 
a4 i4 
b1 ji 
1 2 3 4 5 6 7 8 9 10 
Prefetch Fetch Decode Access Read Execute 


Prefetch Fetch Decode Access Read Execute 


PAB = 


Prefetch Fetch Decode Access Read Execute 


Prefetch Fetch Decode Access Read Execute 


Fy PAB = ' 


Prefetch Fetch Decode Access Read Execute 
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In Example 7—4, the following events occur: 


Cycle 1: 
Cycles 2 and 3: 
Cycle 4: 


Cycle 5: 


Cycles 6 and 7: 


Cycles 8 and 9: 


Cycle 10: 


PAB is loaded with the address of the call instruction. 
Two words of the call instruction are fetched. 


SP is decremented (represented by SP ——), because the 
return address is placed on the stack. The instruction i3 
is fetched; however, it is not allowed to move past the 
decode stage. 


The write address bus (EAB) is loaded with SP’s contents 
and the on-chip return register (RTN) is loaded with the 
return address, a3. After the second word of the call 
instruction (b1) is decoded, PAB is loaded with the new 
value in cycle 5 (shown in row j1). 


The RTN contents are written to the stack using EB in 
cycle 6. The instruction, j1, at address b1 is fetched in 
cycle 6. The two-word call instruction enters the execution 
stage of the pipeline in cycles 6 and 7. 


These cycles are consumed by the call instruction, 
because the next two instructions are not allowed to com- 
plete their execution. 


j1 completes execution. 
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Example 7—5 shows the pipeline behavior for a delayed-call instruction. 


Example 7-5. Delayed-Call Instruction in the Pipeline 


Address Instruction 
al, a2 CALLD b1 
a3 i3 
a4 i4 
b1 ji 
1 2 3 4 5 6 7 
Prefetch Fetch Decode Access Read Execute 
PB = FS EAB = SP 
CALLD |PAB=al] cain | CALLD BIN=aa eos ot 
Prefetch Fetch Decode Access Read Execute 
b1 PAB = a2 | PB=b1 IR=b1 b1 
Prefetch Fetch Decode Access Read 
i3 PAB=a3 | PB=i3 IR =i3 
Prefetch Fetch Decode Access 


Prefetch Fetch Decode 


jt PAB=b1 | PB=j1 | IR=j1 a | jt | 


In this case, the pipeline behaves in the same manner as with the normal call 
instruction. However, in this case the following two instructions, i3 and i4, are 
allowed to complete their execution. Therefore, only cycles 6 and 7 are 
consumed by the delayed-call instruction, making it a 2-cycle instruction. 
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Execute 


i3 


Read Execute 


Access Read Execute 
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The INTR instruction behaves like a CALL instruction. However, because 
INTR is a 1-word instruction, it can compute the vector table address and 
prefetch it one cycle earlier. As shown in Example 7—6, INTR takes only three 
cycles to execute. 


Example 7-6. INTR Instruction in the Pipeline 


Address Instruction 
al, a2 INTRn 

a3 i2 

a4 i3 


vect ji 
1 2 3 4 6 


Prefetch Fetch Decode Access Execute 


IR = INTR 
PAB = ai | PB=INTR RTN = a2 EAB = SP | EB=RITN 


Prefetch Fetch Decode Access Execute 


Pipeline flush PAB = a2 


Prefetch Fetch Decode Access Read Execute 


Prefetch Fetch Decode Access Read Execute 
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7.1.3 Return Instructions in the Pipeline 


Because a return is a single-word instruction, you would expect it to take at 
least one cycle to completely execute. In reality, a standard return instruction 
takes five cycles to execute. Example 7—7 shows the pipeline’s behavior 


during the execution of a return instruction. 


Example 7-7. Return Instruction in the Pipeline 


Address Instruction 
al RET 
a2 i2 
a3 i3 
b1 ji 
1 2 3 4 5 6 7 8 
Prefetch Fetch Decode Access Read Execute 


IR = SP++ 


Prefetch Fetch Decode Access Read Execute 
Prefetch Fetch Decode Access Read Execute 
Prefetch Fetch Decode Access Read 


Prefetch Fetch Decode Access 


Prefetch Fetch Decode 
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Pipeline flush eae =a3 | PB =i3 | | | | | 


Execute 


No No 


Read Execute 


No No 


Access Read Execute 
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In Example 7—7, the following events occur: 


Cycle 1: 


Cycle 2: 
Cycles 3 and 4: 


Cycle 5: 
Cycle 6: 


Cycles 7 and 8: 


Cycles 9 and 10: 


Cycle 11 


The PAB is loaded with the address of the return instruc- 
tion. 


The return instruction opcode is fetched. 


Two more instructions, i2 and i3, are fetched. Although 
these two instructions are fetched by the device, they are 
not allowed to move past the decode stage and are 
discarded. In cycle 4, SP is incremented (represented by 
SP ++) and DAB is loaded with the contents of SP in order 
to read the return address from the stack. 


The top of the stack is read using DB. 


The return instruction enters the execution stage of the 
pipeline. The address fetched from the stack is loaded 
onto PAB. This allows for fetching the next instruction, j1, 
from the return address. 


These cycles are consumed by the return instruction, 
because the next two instructions, i3 and i4, do not com- 
plete their execution. 


Because no instructions were fetched in cycles 4 and 5, 
cycles 9 and 10 are dummy cycles. 


j1 completes execution. 
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Example 7-8 shows the pipeline’s behavior during the execution of a delayed- 
return instruction. 


In a delayed-return instruction, the 54x pipeline behaves in the same way as 
with the normal return instruction. However, the following two instructions, i3 
and i4, are allowed to complete their execution, so only cycles 6, 7, and 8 are 
consumed by the delayed-return instruction, making it a 3-cycle instruction as 
shown in Example 7-8. 


Example 7-8. Delayed-Return Instruction in the Pipeline 


Address Instruction 
al RETD 
a2 i2 
a3 i3 
b1 ji 
1 2 3 4 5 6 7 8 9 10 11 
Prefetch Fetch Decode Access Read Execute 
PAB = PelS IR = SP++ 
RETD RETD | RETD |DAB=sp | 08=61 | RETD 
Prefetch Fetch Decode Access Read Execute 
fi : PAB = i 
Prefetch Fetch Decode Access Read Execute 
A : PAB = m 
Prefetch Fetch Decode Access Read Execute 
: No No . F 
. dts bi pee] | 7 
Prefetch Fetch Decode Access Read Execute 


: No No , ' 


Prefetch Fetch Decode Access Read Execute 
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Example 7—9 and Example 7—10 show the pipeline behavior for a return-with- 
interrupt-enable (RETE) instruction and a delayed return-with-interrupt- 
enable (RETED) instruction, respectively. The pipeline behavior for these 
instructions is similar to that of the standard return and delayed-return instruc- 
tions, respectively, and these instructions also take same number of cycles to 
execute. The difference is that these two instructions enable interrupts globally 
by resetting the INTM bit during the execute stage of the pipeline. 


Example 7-9. Return-With-Interrupt-Enable Instruction in the Pipeline 


Address Instruction 
al RETE 
a2 i2 
a3 i3 
b1 ji 
1 2 3 4 5 6 7 8 9 10 11 
Prefetch Fetch Decode Access Read Execute 
RENE DB =b1 | INTM=0 
Prefetch Fetch Decode Access Read Execute 
err PAB = : 
Prefetch Fetch Decode Access Read Execute 
ees PAB = . 
Pipeline flush PB =i3 
a3 
Prefetch Fetch Decode Access Read Execute 
Dummy cycle Io BY 
Wy prefetch fetch 
Prefetch Fetch Decode Access Read Execute 
No No 


Dummy cycle prefetch fetch 


Prefetch Fetch Decode Access Read Execute 


jt PAB =b1 | PB=j1 jt 
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Example 7-10. Delayed Return-With-Interrupt-Enable Instruction in the Pipeline 


Address Instruction 
al RETED 
a2 i2 
a3 i3 
b1 ji 
1 2 3 4 5 6 4 8 9 10 11 
Prefetch Fetch Decode Access Read Execute 
PAB = PB = IR = SP++ 
FEED RETED | RETED | DAB=sp | PB=51 | INTM=0 
Prefetch Fetch Decode Access Read Execute 
a A PAB = f 
Prefetch Fetch Decode Access Read Execute 
: i PAB = ‘ 
Prefetch Fetch Decode Access Read Execute 
: No No . : 
Prefetch Fetch Decode Access Read Execute 


: No No , . 


Prefetch Fetch Decode Access Read Execute 
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Example 7-11 and Example 7—12 show pipeline behavior for a return-fast 
(RETF) instruction and for a delayed return-fast (RETFD) instruction, respec- 
tively. The RETF instruction, unlike the RETE instruction, does not read the 
return address from the stack. Instead, it reads it from the RTN register. This 
allows the instruction to load PAB with the return address two cycles earlier 
than a RETE instruction can. As shown in the examples, the RETF instruction 
takes only three cycles to execute; the delayed version of the instruction, 
RETFD, executes in one cycle. 


Example 7-11. Return-Fast Instruction in the Pipeline 


Address Instruction 

al RETF 

a2 i2 

a3 i3 

b1 ji 

1 2 3 4 5 6 7 8 9 
Prefetch Fetch Decode Access Read Execute 
le IP 
Prefetch Fetch Decode Access Read Execute 
Prefetch Fetch Decode Access Read Execute 
Pipeline flush | PAB = a3 | PB =i3 | | | | | 
Prefetch Fetch Decode Access Read Execute 
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Example 7-12. Delayed Return-Fast Instruction in the Pipeline 


Address Instruction 

al RETFD 

a2 i2 

a3 i3 

b1 ji 

1 2 3 4 5 6 iG 8 9 
Prefetch Fetch Decode Access Read Execute 
PB = IR = SP++ 
RETFD |PAB=al | petrp | RETFD | Read RIN WS 
Prefetch Fetch Decode Access Read Execute 
Prefetch Fetch Decode Access Read Execute 
Prefetch Fetch Decode Access Read Execute 
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7.1.4 Conditional Execute Instructions in the Pipeline 


Because XC is single-word instruction, it takes at least one instruction cycle 
to completely execute. Example 7-13 shows pipeline behavior during the 
execution of XC. 


Example 7-13. XC Instruction in the Pipeline 


Address Instruction 
al if 
a2 i2 
a3 i3 
a4 XC 2, cond 
a5 id 
a6 i6 
1 2 3 4 5 6 7 8 9 10 11 
Prefetch Fetch Decode Access Read Execute 


Prefetch Fetch Decode Access Read Execute 
Prefetch Fetch Decode Access Read Execute 
Prefetch Fetch Decode Access Read Execute 
XC PAB = a4 | PB=XC | IR=XC evauate} | 
Prefetch Fetch Decode Access Read Execute 
i5 or NOP PAB = a5 | PB=i5 Conditional execution of i5 
Prefetch Fetch Decode Access Read Execute 


i6 or NOP | PAB = a6 [ re=i6 | Conditional execution of i6 | 


In Example 7-13, the following events occur: 


Cycle 4: The PAB is loaded with the address of the XC instruction. 
Cycle 5: The XC instruction opcode is fetched. 


Cycle 7: When the XC instruction moves into the access stage of the pipe- 
line in cycle 7, any conditions specified by the XC instruction are 
evaluated. If the tested conditions are true, the next two instruc- 
tions, i5 and i6, are decoded and allowed to execute. However, 
if the tested conditions are false, i5 and i6 are not decoded. 
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To execute XC in one cycle, the CPU evaluates test conditions in the access 
stage of the pipeline. This means that the two 1-word instructions (or one 
2-word instruction) immediately prior to the XC instruction will not have com- 
pletely executed before the conditions are tested. Because the condition 
codes are affected only by instructions in the execute stage, those two instruc- 
tions have no effect on the operation of XC. 


7.1.5 Conditional-Call and Conditional-Branch Instructions in the Pipeline 
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Because a call instruction consists of two instruction words, you would expect 
it to take at least two cycles to execute completely. A standard conditional-call 
instruction actually takes either five cycles to execute if the call is taken or three 
cycles to execute if the call is not taken. 


Example 7—14 shows pipeline behavior during the execution of a conditional- 
call instruction (CC). 


A conditional-call instruction is similar in its pipeline behavior to an uncondi- 
tional-call instruction. The only exception is that the test conditions for a condi- 
tional-call instruction are evaluated in the execute stage of the pipeline. As 
shown in Example 7—14, when the test conditions are evaluated in cycle 7, the 
previous instruction, i1, has completely executed. Furthermore, the next two 
instructions after CC, i4 and i5, are also fetched. If the test conditions are 
evaluated to be false, these two instructions proceed through the pipeline. 
Otherwise, they are discarded. The instruction prefetch in cycle 7 is also 
dependent on the evaluated conditions. If the conditions are true, PAB is 
loaded with the call address (b1); otherwise, it is loaded with the next incre- 
mental address (a6). 


If the evaluated conditions are true, i4 and i5 do not execute in cycles 10 and 
11. In this case, the CC instruction becomes a 5-cycle instruction. However, 
if the evaluated conditions are false, i4 and i5 execute, making CC a 3-cycle 
instruction. 


Example 7-15 shows pipeline behavior during the execution of a delayed 
conditional-call (CCD) instruction. 


The pipeline behaves in the same manner as it does for the CC instruction. 
However, the following two instructions, i3 and i4, are allowed to complete their 
execution regardless of whether the tested conditions are true or not. Only 
cycles 7, 8, and 9 are consumed by the CCD instruction, making it a 3-cycle 
instruction. 
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Example 7—14. CC Instruction in the Pipeline 
Address Instruction 
al i1 
a2,a3 CC b1, cond 
a4 i4 
a5 i 
a6 


b1 
1 2 3 4 5 6 
Prefetch Fetch Decode Access Read Execute 


Prefetch Fetch Decode Access Read Execute 


EAB=SP | EB=RIN 


Prefetch Fetch Decode Access Read Execute 


Prefetch Fetch Decode Access Read Execute 


Prefetch Fetch Decode Access Read Execute 


F feral No No =A rn F F 
i4 or pipeline flush prefetch IR = i4 and conditional execution of i4 


Prefetch Fetch Decode Access Read Execute 


i5 or pipeline flush PAB = a5 IR = i5 and conditional execution of i5 


Prefetch Fetch Decode Access Read Execute 


PAB = PB= IR = ; ; 
es | Si [ce || [em 
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Example 7-15. CCD Instruction in the Pipeline 


Address 
al 
a2, a3 
a4 
a5 
a6 
b1 

Fi 
Prefetch 


CCD 


b1 


Pipeline flush 


i6 or j1 
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Instruction 
i1 
CCD b1, cond 
i4 
i5 
i6 
ji 
2 8 4 
Fetch Decode Access 


Prefetch Fetch Decode 


EB = 
PAB=a2 |PB=CCD | IR=ccp | sp-- | EAR=SP | arn 
z Evaluate 


Prefetch 


Fetch 


Prefetch 


Example 7-16 and Example 7-17 show the pipeline’s behavior during the 
execution of a conditional branch (BC) instruction and a delayed conditional 


5 
Read 


Access 


Decode 


Fetch 


Prefetch 


No No 2% 
Pere Pas fee] ff | 


6 
Execute 


Read 


Access 


Decode 


Fetch 


Prefetch 


branch (BCD) instruction. 


The behavior of the conditional branch (BC) and the delayed conditional 
branch (BCD) instructions in the pipeline is similar to that of the CC and CCD 
instructions, respectively. The difference is that no return address is written to 
the stack in this case. As shown in Example 7—16, a BC instruction takes either 
three or five cycles to execute, depending on whether or not the branch is 


Execute 


Read 


Access 


Decode 


Fetch 


Prefetch 


Execute 


Read 


Access 


Decode 


Fetch 


Execute 


Read 


Access 


Decode 


taken. A BCD instruction executes in three cycles. 


Execute 


Read 


Access 


Execute 


Read 


Execute 


PAB = PB = IR = iecates 
ee | ei |e | | [oo | 
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Example 7-16. BC Instruction in the Pipeline 


Address Instruction 

al i1 

a2,a3 BC b1, cond 

a4 i4 

a5 id 

a6 i6 

b1 jl 

1 2 3 4 5 6 7 8 9 10 11 12 

Prefetch Fetch Decode Access Read Execute 


Prefetch Fetch Decode Access Read Execute 
=== 
Prefetch Fetch Decode Access Read Execute 
pale TT 1 
Prefetch Fetch Decode Access Read Execute 
~e rT 1 TT 
Prefetch Fetch Decode Access Read Execute 
Prefetch Fetch Decode Access Read Execute 


Prefetch Fetch Decode Access Read Execute 


ee PAB = PB = IR = ey 
i6 or j1 a | Son feo ||| eo 
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Example 7-17. BCD Instruction in the Pipeline 


Address Instruction 

al i1 

a2,a3 BCD b1, cond 

a4 i4 

a5 id 

a6 i6 

b1 jl 

1 2 3 4 5 6 7 8 9 10 11 12 
Prefetch Fetch Decode Access Read Execute 
Prefetch Fetch Decode Access Read Execute 
BCD 
Prefetch Fetch Decode Access Read Execute 
Prefetch Fetch Decode Access Read Execute 
Guiacalid ie ee 
Prefetch Fetch Decode Access Read Execute 
. No No -i 
‘ ileal Pas feet of fe | 
Prefetch Fetch Decode Access Read Execute 

, PAB = = i 5 
i | | 


Prefetch Fetch Decode Access Read Execute 


ee PAB = PB = IR = mee 
i6 or j1 ia | Sen [et | | feo | 


7-24 


Interrupts and the Pipeline 


7.2 Interrupts and the Pipeline 
Example 7-18 shows the pipeline behavior when an interrupt is taken. 


As shown in Example 7-18, if an interrupt is serviced at the end of cycle 3, an 
INTR instruction is automatically placed into the decode stage of the pipeline 
during the next cycle (4). The instruction, i2, is not decoded because the INTR 
instruction is placed into the pipeline at that stage. During the next three cycles, 
the instructions that have already been decoded are executed. Cycles 7, 8, 
and 9 are taken by the INTR instruction. The first instruction in the ISR, 
RETFD, is executed in cycle 10. Cycles 11 and 12 are consumed by the two 
1-word instructions that constitute the two delay slots of the RETFD instruc- 
tion. In the following cycle, instruction i2 is executed, completing the return 
from the ISR. 


As shown in the figure, interrupt overhead (the number of cycles required to 
branch to an ISR) is only three cycles. The return from the interrupt takes only 
one cycle because RETFD is a single-cycle instruction. Because only four 
words are reserved for each interrupt in the interrupt vector table, if an ISR 
requires more than four instruction words, it must be located elsewhere. In this 
case, a branch type instruction in the interrupt vector table. This would result 
in a slightly higher interrupt overhead. 


Pipeline 7-25 


Interrupts and the Pipeline 


Example 7-18. Interrupt Response by the Pipeline 


Address Instruction 
al i1 ; Interrupt taken after executing this instruction 
a2 i2 
a3 i3 
a4 i4 
vectt RETFD ; First instruction in the interrupt vector table 
vect2 ji 
vect3 j2 
1 2 3 4 5 6 7 8 9 10 an 12 13 
Prefetch Fetch Decode Access Read Execute 


Prefetch Fetch Decode Access Read Execute 


INTR 
(inserted) 


PAB = a2 Be NTR EAB=SP | EB=RTN 


Prefetch Fetch 


Prefetch Fetch 


Decode Access Read Execute 


Pipeline flush 


Decode Access Read Execute 


Pipeline flush 


Pre- 
fetch Fetch 


PAB= | _PB= IR= SP. _ 
RETFD RETED | RETED | RdRTN | | INTM = 0 


Prefetch Fetch 


Decode Access Read Execute 


Decode Access Read Execute 


Prefetch Fetch Decode Access Read Execute 


, PAB = : = . 


Prefetch Fetch Access Read 


ree[eetee[ T [=| 


Decode Execute 
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7.3 Dual-Access Memory and the Pipeline 


The ’54x features on-chip memory that supports two accesses in a single 
cycle. This dual-access memory is organized as several independent memory 
blocks. Simultaneous accesses to different blocks are supported with no 
conflicts: while one instruction in the pipeline accesses one block, another 
instruction at the same stage in the pipeline can access a different block with- 
out conflict. Furthermore, each memory block supports two accesses in a 
single cycle: two instructions, each in different stages of the pipeline, can 
access the same block simultaneously. However, a conflict can occur when 
two simultaneous accesses are performed on the same block. The ’54x CPU 
resolves these conflicts automatically; this is discussed later in this section. 
Table 7—1 shows the block size and number of blocks for each ’54x device. For 
more information about DARAM organization, see section 3.3.2, On-Chip 
RAM Organization, on page 3-23. 


Table 7-1. DARAM Blocks 


‘Device ~~—~—<Block Sizet_ ——~—SsNumber of Blocks _ 
“541 s—i“‘éOSOC#C#*#*dKWords:=”))))))™))™06UOUSDU 
542 2K words 5 

543 2K words 5 

B45 2K words 3 

B46 2K words 3 

548 2K words 4 

B49 2K words 4 

5402 8K words 2 

5410 2K words 4 

5420 (each subsystem) 8K words 2 


T Note that the first block is slightly smaller due to the memory-mapped registers and the scratch- 
pad RAM. 


Each dual-access memory block supports two accesses in one cycle by per- 
forming one access in the first half-cycle and the other in the next half-cycle. 
Table 7-2 lists the accesses performed in each half-cycle, and Figure 7-3 
shows how the different types are performed. Address bus loads are omitted 
from the diagram for simplicity. 
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Table 7-2. Accessing DARAM Blocks 


This type of access ... Is performed in the ... 
Instruction fetch using PAB/PB First half-cycle 

First data operand read using DAB/DB First half-cycle 
Second data operand read using CAB/CB Second half-cycle 
Data operand write using EAB/EB Second half-cycle 


Figure 7-3. Half-Cycle Accesses to Dual-Access Memory 


(a) Instruction word fetch 


Prefetch Fetch Decode Access Read Execute 
Read 
PB 


(b) Instruction performing single-operand read 


Prefetch Fetch Decode Access Read Execute 
Read 
DB 


(c) Instruction performing dual-operand read 


Prefetch Fetch Decode Access Read Execute 
Read | Read 
CB DB 


(d) Instruction performing single-operand write 


Prefetch Fetch Decode Access Read Execute 


Write 
EB 
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(e) Instruction performing dual-operand write (two cycles) 


Prefetch Fetch Decode Access Read Execute 
Write 
EB 
Prefetch Fetch Decode Access Read Execute 


Write 
EB 


(f) Instruction performing operand read and operand write 


Prefetch Fetch Decode Access Read Execute 
Read Write 
DB EB 


Because two types of access are scheduled and only one access is performed 
in each half-cycle, conflicts can occur. These conflicts are automatically 
resolved by the CPU either by rearranging the order of accesses or by delaying 
an access by one cycle. The following sections describe these resolved 
memory access conflicts. Keep in mind that these conflicts appear only if all 
accesses are being performed on the same dual-access memory block. 


7.3.1. Resolved Conflict Between Instruction Fetch and Operand Read 


If a dual-access memory block is mapped in both program and data spaces, 
an instruction fetch will conflict with a data operand read access if they are per- 
formed onthe same memory block. The 54x resolves this conflict automatical- 
ly by delaying the instruction fetch by one cycle, as shown in Example 7-19. 
Inthe figure, it is assumed that instructions i2 and i3 do not access the dual-ac- 
cess memory block where the code resides. 
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Example 7-19. Instruction Fetch and Operand Read 


LD *AR2+, A; AR2 is pointing to the same DARAM block where the code resides. 


i2 
i3 
i4 


Prefetch Fetch Decode Access Read Execute 


LD*AR2+, 


Prefetch Fetch Decode Access Read Execute 


supposed to occur here) 


Prefetch Fetch Decode Access Read Execute 


Read 
PB 


Prefetch Fetch Decode Access Read Execute 


Suppo Si. ccurhere) _ = ANS Pf 


i4 (delayed instruction fetch) 


Legend: 


Prefetch Fetch Decode Access Read Execute 


Where a memory access actually 


Where instruction fetch is supposed to occur ecclire 


7.3.2 Resolved Conflict Between Operand Write and Dual-Operand Read 
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Another conflict arises if a single-operand write instruction is followed by an 
instruction that does not perform a write access and this instruction is followed 
by a dual-operand read instruction. This is shown in Example 7—20, in which 
AR3 and AR5 point to the same dual-access memory block. 


There is aconflict between the operand write access (EB bus) and the second 
data read access (CB bus). This conflict is resolved automatically by delaying 
the write access by one cycle. The actual execution time of these instructions 
does not increase, because the delayed write access is performed while the 
second instruction is in the execute stage. 


If any read access (via DB or CB) is from the same memory location in on-chip 
memory where the write access should occur, the CPU bypasses reading the 
actual memory location; instead, it reads the data directly from an internal bus. 
This allows the pipeline to perform a write access in a later pipeline stage than 
that in which the next instruction reads from the same memory location. 


Dual-Access Memory and the Pipeline 


Example 7-20. Operand Write and Dual-Operand Read Conflict 


STLA, *AR3+ 
LD #0,A 
ADD *AR4+, *AR5+, A ; AR3 and ARS both point to the same dual-access memory block. 


Prefetch Fetch Decode Access Read Execute 


STL A, *AR3+ , 
nse sei ieee ee ee! Pot tft SAS 


Prefetch Fetch Decode Access Read Execute 
LD #0, A (write access of previous : 
een a) rok’ ft ee 


Prefetch Fetch Decode Access Read Execute 


ADD *AR4+, AR5, A 


Legend: Where an access is supposed to occur Where an access actually occurs 


7.3.3. Resolved Conflict Among Operand Write, Operand Write, and Dual-Operand 
Read 


If the second instruction in the case described above is an operand-write type 
instruction, then the write access requested by the first instruction cannot be 
moved to the next cycle. The CPU resolves the conflict by inserting a dummy 
cycle after the first instruction. This is illustrated in Example 7-21, in which 
AR3 and AR&d point to the same dual-access memory block. 


Pipeline 7-31 


Dual-Access Memory and the Pipeline 


Example 7-21. Operand Write and Operand Read Conflict 


STLA, *AR3+ 
STH A, *AR2 
ADD *AR4+, *AR5+, A ; AR3 and ARS both point to the same dual-access memory block 


Prefetch Fetch Decode Access Read Execute 


Prefetch Fetch Decode Access Read Execute 


a = [| [| | [ | [ | 


Prefetch Fetch Decode Access Read Execute 


Prefetch Fetch Decode Access Read Execute 


Legend: Where read or write occurs 
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7.4 Single-Access Memory and the Pipeline 


The ’54x also features on-chip single-access memory that supports one 
access per cycle to each memory block. There are two different types of single- 
access memory that are available on ’54x devices: 


_] Single access read-write memory (SARAM) 
_] Single-access read-only memory (ROM or DROM) 


Both types of single-access memory behave similarly in terms of pipelined 
accesses, with the exception that ROM and DROM cannot be written to. These 
memory blocks are contiguous in memory with the first block beginning at the 
start address of SARAM or ROM. For more information about memory block- 
ing, see section 3.2.2, On-Chip ROM Organization, on page 3-17, and section 
3.3.2, On-Chip RAM Organization, on page 3-23. 


Simultaneous accesses with no conflicts are supported by single-access 
memory as long as the access are to different memory blocks; while one 
instruction in a pipeline stage accesses one memory block, another instruction 
can access a different memory block in the same cycle without any conflict. 


A conflict can occur when two simultaneous accesses are performed on the 
same memory block. In case of such a conflict, only one access is performed 
in that cycle and the second access is delayed until the following cycle. This 
results in a one-cycle pipeline latency. 


A pipeline conflict due to single access memory may occur in several different 
situations. 


(j) Dual-Operand instructions. Many instructions have two memory oper- 
ands to read or write data. If both operands are pointing to the same single- 
access memory block, a pipeline conflict occurs. The CPU automatically 
delays the execution of that instruction by one cycle to resolve the conflict. 
For example: 


MAC *AR2+, *AR3+%,A,B ; This instruction will take two 
; cycles if both operands are in 
; Same SARAM or DROM block. 


Lj 32-bit operand instructions. Instructions that read 32-bit memory oper- 
ands still take only one cycle to execute, even if their operand is in single- 
access memory. Single-access memory blocks are designed to allow a 
32-bit read to occur in one cycle. Instructions that write 32-bit operands 
take two cycles to execute. 


DLD *AR2, A ; This instruction only takes 1 cycle even 
; if the operand is in single-access memory. 
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(j Read-write conflict. If an instruction that writes to a single-access memory 


block is followed by an instruction that reads from the same single-access 
memory block, a conflict occurs because both instructions try to access 
the same memory block simultaneously. In this case, the read access is 
delayed automatically by one cycle. For example: 


STL A, *AR1+ ; AR1 and AR3 points at the same SARAM 
; block. 
LD *AR3, B ; This instruction takes 1 additional 
; cycle due to a memory access conflict. 


On the other hand, a dual-operand instruction that has a read operand and 


awrite operand does not cause this conflict because the two accesses are 
done in two different pipeline stages. For example: 


ST A, *AR2+ ; This instruction does not take any 
| | ADD *AR3+, B ; extra cycles, even if AR2/AR3 point 
; at the same single access memory 
; block. 


Code-data conflict. Another type of memory access conflict can occur 
when SARAM or ROM is mapped in both program and data spaces. In this 
case, if instructions are fetched from a memory block and data accesses 
(read or write) are also performed on the same memory block, the instruc- 
tion fetch is delayed by one cycle. For example: 


LD *ARI+, A ; This read data access delays a 
; subsequent instruction fetch. 

STH A, *AR2 ; This write data access delays a 
; subsequent instruction fetch 


This situation causes significantly higher pipeline latency than the cases 
described previously. This is because each time there is a read or write 
access to the memory block, the pipeline is stalled for one cycle. Itis gen- 
erally recommended that each single-access memory block be reserved 
for either data or program storage to avoid hits each time a data access is 
made to that block. 
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7.5 Pipeline Latencies 


The ’54x pipeline allows multiple instructions to access CPU resources simul- 
taneously. Because CPU resources are limited, conflicts can occur when one 
CPU resource is accessed by more than one pipeline stage. Some of these 
pipeline conflicts are resolved automatically by the CPU by delaying accesses. 
Other conflicts are unprotected and must be resolved by the programmer. 


In general, unprotected conflicts are resolved by rearranging instructions or by 
inserting NOP instruction (no operation performed). They can also be avoided 
by using only instructions that do not create any pipeline conflicts or by observ- 
ing necessary delays before certain registers are accessed. 


7.5.1 Recommended Instructions for Accessing Memory-Mapped Registers 


Unprotected pipeline conflicts can occur when any one of the following 
memory-mapped registers is accessed: 


Auxiliary registers (ARO — AR7) 

Block size register (BK) 

Stack pointer (SP) 

Temporary register (T) 

Processor mode status register (PMST) 

Status registers (STO and ST1) 

Block-repeat counter register (BRC) 

Memory-mapped accumulator registers (AG, AH, AL, BG, BH, BL) 


OOUUOUOUUULU 


However, certain instructions can access these registers without causing pipe- 
line conflicts if you observe appropriate latency cycles. Table 7-3 lists these 
instructions. 


Table 7-3 is valid only if programmers limit themselves to those instructions 
that are listed in column 3 in order to perform functions listed in column 2. 
Otherwise, refer to the following sections to find the latency of each individual 
instruction. Furthermore, this table is provided as a quick reference for pipeline 
latencies. It does not describe all possible pipeline latencies, nor does it 
provide detailed information about latencies. 
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Table 7-3. Recommended Instructions for Accessing Memory-Mapped 


Registers 


Catt Function 


{ 


Writing to ARx/BK with- 
out using an accumulator 


2 Writing to ARx/BK using 
an accumulator 

8} Popping ARx/BK from 
stack 

4 Writing to SP without 
using an accumulator 

5 Writing to SP using an 
accumulator 

6 Writing to T without 
using an accumulator 

7 Writing to T using an 
accumulator 

8 Writing to BRC without 
using an accumulator 

9 Writing to BRC using an 
accumulator 

t Category 


Instruction(s) 


STM 

MVDK 
MVMM 
MVMD 


STLM 

STH 

STL 

Store typet 


POPM 


STM 

MVDK 
MVMM 
MVMD 


STLM 

STH 

Sib 

Store typet 


STM 

MVDK 

LD Smem,T 

LD Smem,T || ST 


STLM 
STH 
STL 


STM 
MVDK 


STLM 

STH 

Syl 

Store typet 


Latency 


ARx update: None 
BK update: The next 
word must not use 
circular addressing 


The next 2 words 
(ARx) or 3 words (BK) 
must not use the same 
register. 


The next 1 word (ARx) 
or 2 words (BK) must 
not use the same reg- 
ister. 


None if CPL = 0 


The next 1 word must 
not use SP if CPL = 1#. 


The next 2 (if CPL = 0) 
or 3 (if CPL = 1) words 
must not use SP#. 


None 


The next word must 
not use T. 


None 


The next instruction 
must not be a 
RPTB[D]. 


Additional Restrictions 


None 


The next instruction 
must not write to any 
ARx, BK, or SP using 
STM, MVDK, or MVMD. 


Do not precede a cate- 
gory 3 instruction with any 
category 2 or 5 instruction 


that writes to any ARx, 
BK, or SP. 


None 


The next instruction must 
not write to ARx, BK, or 
SP using STM, MVDK, or 
MVMD. 


None 


None 


None 


None 


t Any other store-type instruction. See Table 7-5 on page 7-40 for a list of store-type instructions. 
# Interrupts cause an update of SP. This update of SP can interfere with a previous write to SP. Therefore, special considerations 
must be made when using interrupts while executing instructions that update SP. 
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Table 7-3. Recommended Instructions for Accessing Memory-Mapped 
Registers (Continued) 


Catt Function Instruction(s) Latency Additional Restrictions 
10 Writing to ARP LD #k, ARP None None 
t Category 


t Any other store-type instruction. See Table 7-5 on page 7-40 for a list of store-type instructions. 
# Interrupts cause an update of SP. This update of SP can interfere with a previous write to SP. Therefore, special considerations 


must be made when using interrupts while executing instructions that update SP. 
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Table 7-3. Recommended Instructions for Accessing Memory-Mapped 
Registers (Continued) 


Catt Function 

11 Writing to DP 

12 Writing to CPL 

13 Writing to SXM 

14 Writing to ASM 

15 Writing to BRAF 

16 Writing BRC to memory 

17 Writing to OVLY, 
MP/MC, or IPTR 

18 Writing to DROM bit 

19 Calculating an 
exponent 

20 Stack manipulation in 
compiler mode (CPL = 1) 

t Category 


Instruction(s) 


LD #k, DP 
LD Smem, DP 


RSBX 
SSBX 


RSBX 
SSBX 


LD #k, ASM 
LD Smem, ASM 


RSBX 
SSBX 


SRCCD 


ANDM 
ORM 
XORM 


ANDM 
ORM 
XORM 


EXP 
FRAME 


POPM/POPD 
PSHM/PSHD 


Latency 


None 


The next 3 words must 
not use direct addres- 
sing mode. 


The next word must 
not be affected by SXM 
status. 


None 


The next 5 words must 
not contain the last 
instruction word in the 
RPTB loop. 


The next 2 words must 
not contain the last 
instruction word in the 
RPTB loop. 


The next 6 cycles 
must not include an 
instruction fetch from 
the on-chip memory’s 
address range. 


The next 3 words must 
not access the DROM’s 
address range. 


The next instruction 
must not use T. 


The next instruction 
must not use direct 
addressing mode 
(CPie—ai)): 


Additional Restrictions 


None 


None 


None 


None 


None 


None 


An external-bus cycle 
may cause additional 
latency. 


An external-bus cycle 
may cause additional 
latency. 


None 


None 


t Any other store-type instruction. See Table 7-5 on page 7-40 for a list of store-type instructions. 
# Interrupts cause an update of SP. This update of SP can interfere with a previous write to SP. Therefore, special considerations 
must be made when using interrupts while executing instructions that update SP. 
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Table 7-3. Recommended Instructions for Accessing Memory-Mapped 
Registers (Continued) 


Catt Function Instruction(s) Latency Additional Restrictions 
21 Reading AG, AH, AL, Anyinstructionthatcan The previous instruc- None 

BG, BH, orBLasmem- __ read from memory tion must not modify 

ory-mapped registers accumulator A_ or 


accumulator B. 


Category 

t Any other store-type instruction. See Table 7—5 on page 7-40 for a list of store-type instructions. 

# Interrupts cause an update of SP. This update of SP can interfere with a previous write to SP. Therefore, special considerations 
must be made when using interrupts while executing instructions that update SP. 


7.5.2 Updating ARx, BK, or SP—A Resolved Conflict 


Table 7—4 lists 54x instructions that update data-address generation logic 
(DAGEN) registers in the read stage of the pipeline. The DAGEN registers are 
the auxiliary registers (ARx), the block size register (BK), and the stack pointer 
(SP). 


All other instructions that write to these registers perform their writes in the 
execute stage and are store-type instructions. They are listed in Table 7—5. 


Table 7-4. Instructions That Access DAGEN Registers in the Read Stage 


Instruction Type Instructions 
Constant initialization STM #lk, MMR 

ST #lk, SmemtT.+ 
Move type 1 MVDD Xmem, Ymemt 


POPM MMR 
POPD Smemt.t 
DELAY Smemt.t 


Move type 2 MVDK Smem, dmadt 
MVMD MMR, dmadt 


t This operand must be pointing to one of the DAGEN registers. 
+ DP must be 0 to access DAGEN registers. 
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Table 7—5. Store-Type Instructions 


Instruction Instruction Instruction 

MVKD ~~ dmad, Smem STL src, SHFT, Xmem SRCCD Xmem, cond 
MVDM_— dmad, MMR STL src, SHIFT, Smem STRCD Xmem, cond 
MVPD = dmad, Smem ST || ADD CMPS _ src, Smem 
STH src, Smem ST || LD ST T, Smem 
STH src, ASM, Smem ST || LT ST TRN, Smem 
STH src, SHFT, Xmem ST || MAC[R] ADDM _ Smen, #lk 
STH src, SHIFT, Smem ST || MAS[R] ANDM — Smen, #lk 
STLM src, MMR ST || MPY ORM #ik, Smem 
STL src, Smem ST || SUB XORM ~ Smen, #lk 
STL src, ASM, Smem SACCD - src, Xmem, cond 


When astore-type instruction is immediately followed by an instruction that up- 
dates ARx, BK, or SP in the read stage, a conflict can occur, because both 
instructions try to access DAGEN registers. The DAGEN register set can be 
written to only once in a given cycle, so the CPU delays the read stage access 
by one cycle. This access is performed when the second instruction is in the 
execute stage of the pipeline. This generally does not affect the execution time 
of that instruction. Example 7-22, Example 7-23, and Example 7-24 show 
this conflict. 


Pipeline Latencies 


Example 7-22. Resolving Conflict When Updating Multiple ARxs 
(a) Updating AR1 in execute stage and ARZ2 in read stage 


STLM A, ARI ; This instruction updates AR1 in the Execute stage. 

STM #1, AR2 This instruction tries to update AR2 in its read stage, 
creating a conflict with the previous instruction. 

The AR2 update is delayed by one cycle. 


Ne Ne Ne SN 


1 2 3 4 5 6 7 8 


Prefetch Fetch Decode Access Read Execute 
STLM A, AR1 Write to 
AR1 
Prefetch Fetch Decode Access Read Execute 
STM #1, AR2 (1st word) Write to | Write to 
(write delayed by 1 cycle) AR2 AR2 
Prefetch Fetch Decode Access Read Execute 


Legend: Where a write conflict Where the write actually occurs 
occurs 
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Example 7-22. Resolving Conflict When Updating Multiple ARxs (Continued) 
(b) Updating AR1 in execute stage, AR2 in read stage, and AR3 in read stage 


STH A, AR1 ; This instruction updates AR1 in the execute stage. 


STM #1, AR2 ; This instruction tries to update AR2 in its read stage, 
7 Causing a conflict. The update is delayed by one cycle. 


STM #2, AR3 ; This instruction updates AR3 in its read stage. It 
: creates no conflict since the previous instruction was 
7 a two-word instruction. 


1 2 3 4 5 6 7 8 9 10 


Prefetch Fetch Decode Access Read Execute 


STH A, AR1 Write 
to AR1 


Prefetch Fetch Decode Access Read Execute 


STM #1, AR2 (1st word) 
(write delayed by 1 cycle) 


Prefetch Fetch Decode Access Read Execute 


Prefetch Fetch Decode Access Read Execute 


to AR3 


Prefetch Fetch Decode Access Read Execute 


Legend: Where a write conflict occurs Where the write actually occurs 
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Example 7-23. Resolving Conflict When Updating ARx and BK 


STLM B, BK 
MVDK 200h, ARI 


Ne Ne Ne Ne 


STLM B, BK 


Legend: 


MVDK 200h, AR1 (1st word) 
(write delayed by 1 cycle) 


This instruction updates BK in the execute stage. 
This instruction tries to update AR1 in it read 
stage. The CPU delays this update by one cycle to 
resolve the conflict. 


1 2 3 4 5 6 7 
Prefetch Fetch Decode Access Read Execute 
Write to 
BK 
Prefetch Fetch Decode Access Read Execute 


Prefetch Fetch Decode Access Read 


MVDK 200h, AR1 (2nd word) 


Where a write conflict oc- Where the write actually occurs 
curs 


Pipeline 


Execute 
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Example 7-24. Resolving Conflict When Updating SP, BK, and ARx 


STLM A, SP ; This instruction updates SP in the execute stage. 


POPM BK ; This instruction tries to update BK in its read stage. 
; The CPU delays this update by one cycle. 


STM #1, AR1 ; This instruction tries to update AR1 in its read stage, 
, conflicting with the previous instruction. 


; The update is delayed by one cycle. 


1 2 3 4 5 6 7 8 9 10 


Prefetch Fetch Decode Access Read Execute 


STLM 
A, SP 


POPM BK (write 
delayed by 1 cycle) 


STM #1, AR1 (1st word) 
(write delayed by 1 cycle) 


Prefetch Fetch Decode Access Read Execute 
Legend: Where a write conflict Where the write actually occurs 


occurs 


These conflicts are automatically resolved by the ’54x CPU. This generally 
does not affect instruction-execution behavior. However, there is one case in 
which resolution by the CPU can cause an unprotected pipeline conflict. This 
is explained in section 7.5.3, Rules to Determine DAGEN Register Access 
Conflicts. 
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7.5.3 Rules to Determine DAGEN Register Access Conflicts 


Some instructions update DAGEN registers in the read stage. This can result 
in conflict if the previous instruction tries to update a DAGEN register in its 
execute stage. This conflict is resolved automatically by the CPU’s delaying 
the read stage update by one cycle. This delay can cause an additional cycle 
of latency between the instruction that writes to the DAGEN register in its read 
stage and the instruction that follows it. 


The following set of conditions determines when such a conflict can occur: 


(1 The first instruction is one of two types: 


M Store-type instruction that accesses any DAGEN register to load a 
new value (see Table 7—5 on page 7-40) 


m Move type 1 instruction (see Table 7—4) that has a DAGEN register 
conflict with the previous instruction. 


(J The second instruction is a constant-initialization-type instruction, amove 
type 1 instruction, or a move type 2 instruction (see Table 7—4) that writes 
to BK, SP, or any ARx. The instruction must not use a long offset modifier 
(see Example 7-27). 


(1 ~The third instruction uses the same register as the second instruction in 
indirect addressing mode. 
7.5.4 Latencies for ARx and BK 


An unprotected pipeline conflict can occur when accessing an auxiliary regis- 
ter or BK when both of the following two conditions are met: 


_j An instruction writes to an auxiliary register or BK. 


Lj Thenextinstruction uses the same auxiliary register as an address pointer 
or index in indirect addressing mode, or uses BK in circular addressing 
mode. This instruction could also be an MVMM or a CMPR that reads BK 
or the same ARx. 


This conflict occurs because the first instruction updates ARx or BK in either 
the read or execute stage of the pipeline and the following instruction uses BK 
or the same ARx when itis in the access stage of the pipeline. This results in 
an incorrect ARx or BK read by the second instruction, because the previous 
instruction has not yet updated the register’s contents. 


Certain instructions (see Table 7-6) do not have any latency in updating ARx. 
Use these instructions wherever possible to avoid pipeline conflicts. 
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Table 7-6. Pipeline-Protected Instructions for Updating ARx 


To do this: Use this instruction: 
Write an immediate value to ARx STM #lk, MMRT 
Copy a memory location to ARx MVDK Smem, MMRt 
Copy the contents of an ARx to another ARx MVMM MMR, MMR 


T See Table 7-7 for one possible conflict with these instructions. 


STM and MVDK do not conflict with the next instruction for two reasons: 


[j They are two-word instructions. 


[J] They update ARx when the first instruction word is in the read stage of the 
pipeline. 


Table 7—7 shows the latencies between instructions that update and subse- 
quently use ARx. The second and third instructions must access the same 
auxiliary register or BK to cause a latency. Any instruction not mentioned in the 
table has no latency. 


Table 7-8 shows the latencies between instructions that update and subse- 
quently use BK. 


OS 
Note: 


You are responsible for rearranging instructions or inserting NOPs, if necessary, 


to accommodate latencies. 
| ee | 


Table 7—7. Latencies for Accessing ARx 


(a) Latencies based on third-instruction category 


Second Instruction$§ 


STM #Ik, auxreg 
Sil #Ik, auxreg 


MVDK 
MVMD 


MVKD 
MVDM 
MVPD 


POPM 
POPD 
DELAY 
LTD 
MVDD 


Smem, auxreg 
MMR, auxreg 


dmad, auxreg 
dmad, auxreg 
pmad, auxreg 


auxreg 
auxreg 
auxreg 
auxreg 
Xmem, auxreg 


Store-type instructions (see Table 7—5) 


(b) Categories for the third instruction 


Category | 
MVMM auxreg, MMR 
CMPR CC, auxreg 


MVKD dmad, auxind 
MVDM dmad, auxind 
MVPD pmad, auxind 
MACP auxind, pmad, src 
MACD auxind, pmad, src 


ADD _ auxind, shift, src, dst 
LD auxind, shift, dst 
STH - sre, shift, auxind 
STL _ sre, shift, auxind 
SUB — auxind, shift, src, dst 


BANZ[D] auxind = 
Instructions not listed here 


that use ARx in indirect 
addressing mode. 


With a long-offset 
modifier 


With an extended 
shift and a long- 
offset modifier 


Third Instruction 


Category | 
ot 


ot 


1t 


Category Il 


MVKD dmad, auxind 
MVDM dmad, auxind 
MVPD pmad, auxind 
MACP auxind, pmad, src 
MACD auxind, pmad, src 


ADD _ auxind, shift, src, dst 
LD auxind, shift, dst 
STH © sre, shift, auxind 
STL _ sre, shift, auxind 
SUB — auxind, shift, src, dst 


With or without along- FIRS 
offset modifier 


Pipeline Latencies 


Category Il 
0 


ot 


Without a long- 
offset modifier 


With an extended 
shift and without 
a long-offset 
modifier 


With one operand 
using indirect ad- 
dressing mode with 
or without a long- 
offset modifier 


Tt Add one more cycle of latency if the first instruction meets the DAGEN register conflict criteria. See section 7.5.3, Rules to Deter- 


mine DAGEN Register Access Conflicts, for more information. 


§ The destination operand auxreg must point at ARO-AR7 in either direct or indirect addressing mode. The operand auxind must 


use indirect addressing mode. 
{ Shift value between —16 and 15 


Notes: 


2) The first instruction can be any ’54x instruction. 


1) Any instruction that does not fit in either of the two categories has zero latency. 
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Table 7-8. Latencies for Accessing BK 


(a) Latencies based on third-instruction category 
Third Instruction 


Second Instruction§ Category Il 
STM 


#Ik, bkreg 1t ot 
ST #Ik, bkreg 


MVDK_ Smem, bkreg 1T ot 
MVMD_ MMR, bkreg 


MVKD_ dmad, bkreg 2 1 
MVDM_ dmad, bkreg 
MVPD pmad, bkreg 


POPM _bkreg at 1t 
POPD _ bkreg 

DELAY bkreg 

LTD bkreg 

MVDD Xmenm, bkreg 


Category | 


Store-type instructions (see Table 7—5) 3 2 


(b) Categories for the third instruction 
Category | Category Il 
MVKD dmad, circind 


MVKD dmaad, circind 
MVDM dmad, circind 
MVPD pmad, circind 
MACP circind, pmad, src 
MACD circind, pmad, src 


ADD _ circind, shift, src, dst 
LD circind, shift, dst 
STH - sre, shift, circind 
STL © sre, shift, circind 
SUB | circind, shift, src, dst 


BANZ[D] circind 7 
Instructions not listed here 

that use BK in circular 
addressing mode. 


With a long-offset 
modifier 


With an extended 
shiftl and a long- 
offset modifier 


MVDM dmad, circind 
MVPD pmad, circind 
MACP circind, pmad, src 
MACD circind, pmad, src 


ADD  circind, shift, src, dst 
LD circind, shift, dst 
STH © sre, shift, circind 
STL © sre, shift, circind 
SUB | circind, shift, src, dst 


With or without along- FIRS 
offset modifier 


Without a long- 
offset modifier 


With an extended 
shift and without 
a long-offset 
modifier 


With one operand 
using indirect ad- 
dressing mode with 
or without a long- 
offset modifier 


i; 


t Add one more cycle of latency if the first instruction meets the DAGEN register conflict criteria. See section 7.5.3, Rules to Deter- 
mine DAGEN Register Access Conflicts, for more information. 

§ The destination operand bkreg must point at BK in either direct or indirect addressing mode. The operand circind must use 
circular addressing mode. 


1 Shift value between —16 and 15 
Notes: 1) Any instruction that does not fit in either of the two categories has zero latency. 


2) The first instruction can be any ’54x instruction. 
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Pipeline Latencies 


(a) 
ADD A, B 
STM #100h, AR3 ; This instruction does not conflict 
; with the previous instruction. 
LD *AR3+, A ; No latency is required to use AR3. 
(b) 
ADD A, B ; This instruction does not create 
; a DAGEN conflict. 
MVDK 200h, AR7 ; This instruction has zero latency. 
STH B, *AR7+ 
(c) 
STLM A, ARI ; This instruction updates AR1 in 
; the execute stage, possibly 
7 creating a DAGEN conflict. 
MVDK * (200h) , AR2 ; However, this instruction uses a 
; long offset modifier. Therefore, 
; it creates no DAGEN conflict. 
MAR *AR2+ ; No latency is required to use AR2. 


Example 7-26. ARx Updated With a 1-Cycle Latency 


(a) 
ADD A, B ; This instruction does not create 
; a DAGEN conflict. 
POPM AR3 ; This instruction has a 1-cycle 
; latency if AR3 is used in the next 
NOP ; instruction. The NOP is inserted 
LD *AR3+, A ; to avoid the conflict. 
(b) 
STLM A, ARI ; This instruction updates AR1 in 
; the execute stage. 
POPM BK , This instruction tries to update 
; BK in the read stage. The CPU 
; delays the update by one cycle. 
STM #1, AR2 ; This instruction tries to update 
NOP ; AR2 in the read stage. The CPU 
; delays this update by one cycl 
LD *AR2+, B ; This is why one NOP is required. 


Pipeline 7-49 


Pipeline Latencies 


Example 7-27. ARx Updated With and Without a 1-Cycle Latency 


(a) ARx updated with a one-cycle latency 


STLM 


MVDK 
NOP 
MAR 


A, ARI 


100h, AR2 


*AR2+ 


’ 


This instruction creates DAGEN 
conflict. 

The AR2 update is delayed by one 
cycle. 


(bo) ARx updated with no latency after reordering instructions 


MVDK 


STLM 


MAR 


100h, AR2 


A, AR1 


*AR2+ 


This instruction does not require 
any latency. 

This instruction is placed after 
MVDK to avoid a DAGEN conflict. 
No latency is required now. 


Example 7-28. ARx Updated With and Without a 2-Cycle Latency 
(a) ARx updated with a two-cycle latency 


STLM 


POPM 
NOP 
NOP 
LD 


A, SP 


AR1 


*ARI+ 


et 


This instruction creates a DAGEN 
conflict. 

This instruction has a 2-cycle 
latency due to the DAGEN conflict. 


Two NOPs avoid this conflict. 


(b) ARx updated with no latency after reordering instructions 


POPM 


STLM 


LD 


AR1 


A, SP 


*ARI+ 


,A 


This instruction has a 1-cycle 
latency. 

This instruction is placed after 
the POPM to avoid DAGEN conflicts 
and to eliminate the need for 
NOPs. 


Example 7-29. ARx Updated With a 2-Cycle Latency 


ADDA, B 
STLM A, AR 
NOP 

NOP 

MVMM ARI, 


1 


AR2 


This instruction does not create a 
DAGEN conflict. 

This instruction has a 2-cycle 
latency. 
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Example 7-30. BK Updated With a 1-Cycle Latency 


ADD A, B ; This instruction does not create a 
STM #100h, BK ; DAGEN conflict. 
NOP ; This instruction needs a 1-cycl 
; latency. 
ADD *AR1+%, B ; This instruction uses BK for 
; circular addressing 


7.5.5 Latencies for the Stack Pointer 


Stack pointer (SP) latencies discussed in this section occur when SP is used 
in one of two ways: 


J As an offset in direct addressing (when CPL = 1) 
1 In apush, pop, call, return, FRAME, or MVMM operation 


7.5.5.1 SP Used in Compiler Mode (CPL = 1) 
A pipeline conflict occurs if two conditions are simultaneously met: 


[J] One instruction writes to SP. 


_j The next instruction uses SP as the base address for direct addressing in 
compiler mode (CPL = 1), or an interrupt occurs. 


The conflict occurs because the second instruction tries to use SP in a pipeline 
stage that occurs before the previous instruction updates it. 


Table 7-9 lists the latencies between instructions that update and subse- 
quently use SP in compiler mode. 


ES “waa 0.0.0 0 —_aoaauaaau00 oo —wa 
Note: 


You are responsible for rearranging instructions or inserting NOPs, if nec- 


essary, to accommodate for SP latencies. 
| | 


# Interrupts cause an update of SP. This update of SP can interfere with a previous write to SP. Therefore, special considerations 
must be made when using interrupts while executing instructions that update SP. 
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Table 7-9. Latencies for SP in Compiler Mode (CPL = 1) 


(a) Latencies based on third-instruction category 


Third Instruction 


Second Instruction Category | Category Il 
STM #lk, SP 1t ot 

ST #lk, SP 

MVDK Smem, SP 1T ot 
MVMD_ MMR, SP 

MVKD_ dmad, SP 2 1 


MVDM_ dmad, SP 
MVPD pmad, SP 


MVDD_ Xmen, spind at 1t 
POPM SP 
POPD SP 


FRAME k 1 0 
MVMM MMR, SP 

POPM MMR 

POPD Smem 

PSHM MMR 

PSHD Smem 

RETFD 


Store-type instructions (See Table 7-5) 3 2 


(b) Categories for the third instruction 


Category | 


All instructions using SP in 
direct addressing mode, 
except those listed in 
Category Il. 


Category Il 


MVKD dmad, dirmem 
MVDM dirmem, MMR 
MVPD pmad, dirmem 
MACP dirmem, pmad, src 


MACD dirmem, pmad, src 


ADD dirmem, shift, src, dst 
LD dirmem, shift, dst 

STH sre, shift, dirmem 
STL sre, shift, dirmem 
SUB dirmem, shift, src, dst 


Legend: SP Destination operand pointing to the stack pointer in either direct or indirect addressing modes 
MMR Any memory-mapped register except SP 
spind Destination operand pointing to the stack pointer using indirect addressing mode 
dirmem Operand using direct addressing mode in compiler mode (CPL = 1) 
t Add one more cycle of latency if the first instruction meets the DAGEN register conflict criteria. See section 7.5.3, 
Rules to Determine DAGEN Register Access Conflicts, for more information. 
+ Shift value between —16 and 15. 


Notes: 1) Any instruction that does not fit in either of the two categories has zero latency. 
2) The first instruction can be any ’54x instruction. 


With an extended shiftt 
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Example 7-31. SP Load With No Latency in Compiler Mode (CPL =1) 


(a) 
ADD A, B ; This instruction does not create a 
; DAGEN conflict. 
MVDK 100h, SP ; This SP update requires a zero 
; latency according to the above table. 
ADD 50h,-3,A,B 
(b) 
STLM A, AR2 ; This instruction does not affect 
; pipeline latency. 
POPM ARI ; This SP update requires a zero 


; latency according to the table. 
MVKD 100h, 1h 


Example 7-32. SP Load With a 1-Cycle Latency in Compiler Mode (CPL = 1) 


(a) 
STLM A, AR2 ; This instruction does not affect 
; pipeline latency. 
MVMM ARI, SP ; This SP update requires a one-cycle 
; latency since the next instruction 
NOP ; uses SP when CPL = 1. 
LD 50h,A 
(b) 
ADD A, B ; This instruction does not create a 
; DAGEN conflict 
STM #100h, SP ; This SP update requires a one-cycle 
; latency since the next instruction 
NOP ; uses SP when CPL = 1. 
LD 50h,A 
(c) 
ADD A,B ; This instruction does not affect 
; pipeline latency. 
RETFD ; SP is incremented after popping the 
; return address. 
NOP 
LD 50h, A ; This instruction cannot be placed in 


; the first delay slot since it uses 
; Girect addressing mode with the new 
; SP value. 
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Example 7-33. SP Load With and Without a 2-Cycle Latency 
(a) SP Load With a Two-Cycle Latency 


STLM A, BK 


MVDK 100h, SP 
NOP 

NOP 

LD 50h, A 


r 
r 
a 
r 


r 


This instruction creates a DAGEN 
conflict. 

This SP update requires 2 cycles 
of latency according to the above 
table. 


(b) SP Load With No Latency 


MVDK 100h, SP 
STLM A, BK 
LD 50h, A 


r 


This SP update requires 1 cycle 

of latency. 

This instruction is placed after the 
MVDK instruction to prevent a DAGEN 
conflict. 

No NOPs are required in this case. 


Example 7-34. SP Load With a 2-Cycle Latency in Compiler Mode (CPL = 1) 


ADD A, B 


POPM SP 

NOP 

NOP 

LD 50h, A 


This instruction does not affect 
pipeline latency. 

The new value of SP is popped from 
the stack. Two NOPs are required to 
use the new value of SP. 


Example 7-35. SP Load With a 3-Cycle Latency in Compiler Mode (CPL = 1, DP = 0) 


STLM A, AR1 
STH A, SP 
NOP 
NOP 
NOP 
LD 50h,A 


This instruction does not affect 
pipeline latency. 

This SP update requires a three-cycle 
latency since the next instruction 
uses SP when CPL is 1. 
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7.5.5.2 SP Used in Push, Pop, Call, Return, FRAME, and MVMM Operations 
A pipeline conflict occurs if two conditions are simultaneously met: 


1 An instruction updates SP. 


_j The next instruction uses the stack for a push, pop, call, return, FRAME, 
or MVMM operation. 


The conflict occurs because the second instruction tries to use SP in a pipeline 
stage that occurs before the stage in which the previous instruction updates 
SP. 


Table 7—10 lists instructions that do not have any latency in updating SP when 


the CPU is not in compiler mode (CPL = 0). These instructions should be used 
wherever possible to avoid conflicts. 


Table 7-10. Pipeline-Protected Instructions to Update SP in Noncompiler Mode (CPL = 0) 


To do this: Use this instruction: 
Write an immediate value to SP STM _ #lk, SPT 
Copy a memory location to SP MVDK Smem, SPT 
Copy the contents of an ARx or BK to SP MVMM MMR, SP 
Move SP by a frame FRAME k 


t See Table 7-11 for one possible conflict with these instructions. 


Table 7-11 lists the latencies between instructions that update and use SP in 
noncompiler mode (CPL = 0). 


Te | 


Note: 


You are responsible for rearranging instructions or inserting NOPs, if necessary, 
to accommodate SP latencies. 


eee sss) 
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Table 7-11. Latencies for SP in Noncompiler Mode (CPL = 0) 


(a) Latencies based on third-instruction category 


Third Instruction 


Second Instruction Category | Category Il 
STM #lk, SP ot 0 

ST #lk, SP 

MVDK Smem, SP ot 0 


MVMD MMR, SP 


MVKD_ dmad, SP 1 0 
MVDM_ dmad, SP 
MVPD pmad, SP 


MVDD_ Xmen, spind 1t ot 
POPM SP 

POPD SP 

Store-type instructions (See Table 7-5) 2 1 


(b) Categories for the third instruction 


Category | Category Il 

PSHM MMR PSHM MMR 

PSHD Smem Without a long-offset PSHD Smem With a long-offset 

POPM MMR modifier POPM MMR modifier 

PSHM Smem PSHM Smem 

CALL[D] address CALAI[D] address 

CC[D] address FCALA|D] 

FCALLI[D] 

FRET[D] 

FRETE[D] 

INTR k 

RC[D] 

RET[D] 

RETE[D] 

RETF[D] 

MVMM  SP,MMR 

FRAME k 

TRAP n 

Legend: SP Destination operand pointing to the stack pointer in either direct or indirect addressing modes 

MMR Any memory-mapped register except SP 
spind Destination operand pointing to the stack pointer using indirect addressing mode 


T Add one more cycle of latency if the first instruction meets the DAGEN register conflict criteria. See section 7.5.3, 
Rules to Determine DAGEN Register Access Conflicts, for more information. 


Notes: 1) Any instruction that does not fit in either of the two categories has zero latency. 


2) The first instruction can be any ’54x instruction. 
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Example 7-36. SP Load With No Latency in Noncompiler Mode (CPL = 0) 


(a) 
ADD A, B ; This instruction does not create 
; a DAGEN conflict 
STM #100h, SP ; This SP update does not require any 
; latency according to the above table. 
PSHM AR1L 
(b) 
STH A, 100h ; This instruction does not create 
; a DAGEN conflict. 
MVDK 200h, SP ; This SP update does not require any 
; latency according to the above table. 
FRAME 10 


Example 7-37. SP Load With and Without a 1-Cycle Latency in Noncompiler Mode 
(CPL = 0) 


(a) SP Load With a One-Cycle Latency 


STLM A, ARI ; This instruction causes a DAGEN 

, conflict with the next instruction. 
MVDK 200h, SP ; This SP update requires a one-cycle 
NOP ; latency. 
PSHM AR2 


(b) SP Load With No Latency 


MVDK 200h, SP ; This instruction requires no latency. 
STLM A, ARI , This instruction was placed after 
; MVDK to avoid a DAGEN conflict. 


PSHM AR2 


Example 7-38. SP Load With a 1-Cycle Latency in Noncompiler Mode (CPL = 0) 


STLM A, ARI ; This instruction does not affect the 
; pipeline latency for the next 
; instruction. 


STLM B, SP ; This SP update requires a one-cycle 
NOP ; latency. 
CALA A 
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7.5.6 Latencies for Temporary Register (T) 


A pipeline conflict can occur when accessing T if two conditions are simulta- 
neously met: 


(1 An instruction writes to T 
Lj The next instruction uses T for a shift or bit-test operation. 


The conflict occurs because the second instruction tries to use T in a pipeline 
stage that occurs before the previous instruction updates it. 


Table 7-12 lists instructions that do not have any latency in updating T. Use 
these instructions wherever possible to avoid any conflicts. 


Table 7-12. Pipeline-Protected Instructions for Updating T 


To do this: Use this instruction: 

Write an immediate value to T STM #lk, T 

Copy a memory location to T MVDK Smem, T 

Copy a memory location to T LD Smem, T 

Copy a memory location to T ST src, Ymem 
||LD = Xmem, T 


Table 7-13 lists the latencies between instructions that update and use T. 


 — 
Note: 


You are responsible for rearranging instructions or inserting NOPs, if neces- 


sary, to accommodate T latencies. 
a | 
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Table 7-13. Latencies for the T Register Based on Secona-Instruction Category 


(a) Latencies Based on Second-Instruction Category 


Second Instruction 


First Instruction Category | 
MVKD_ dmad, T 1 
MVDM_ dmad, T 

POPM T { 
POPD T 

DELAY T 

MVDD Xmem, Ting 

Store-type instructions (see Table 7—5) i 

EXP src 1 


(b) Categories for the Second Instruction 


Category | 

LD Smem, TS, dst 

ADD Smem, TS, src Without a long-offset modifier 
SUB Smem, TS, src 


NORM _ src, dst 
BITT Xmem 
DADST Lmenm, dst 
DSADT Lmem, dst 
DSUBT Lmen, dst 


Legend: T Destination operand pointing at T in either direct or indirect addressing 
modes. 
Ting Destination operand pointing at T using indirect addressing mode. 


Note: Any instruction that does not fit in Category | has zero latency. 
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Example 7-39. T Load With No Latency 


(a) 
LD *AR3+, T ; This T update does not require 
LD *AR5+,TS,A ; any latency. 

(b) 
STM #100h, T ; This T update does not require 
LD *AR5+,TS,A ; any latency. 


Example 7-40. T Load With a 1-Cycle Latency 


(a) 
POPM Tt ; This instruction requires a on 
NOP ; cycle latency. 
BELT *ARS+ 

(b) 
EXP A ; This instruction requires a one- 
NOP ; cycle latency. 
NORM A 
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7.5.7 Latencies for Accessing Status Registers 


The following status register fields and bits are affected by latency: 


ARP (auxiliary register pointer) 
CMPT (compatibility mode bit) 
CPL (compiler mode bit) 

DP (data page pointer) 

SXM (sign-extension mode bit) 
ASM (accumulator shift mode field) 
BRAF (block-repeat activity flag) 


UOUUOUCUU 


7.5.7.1. ST1 anda Repeat Block Loop 


Some instructions write to ST1 in the execute stage of the pipeline. If any of 
these instructions is immediately followed by a RPTB[D] instruction that sets 
the BRAF flag in ST1 an incomplete repeat-block loop results. This occurs 
because RPTB[D] sets BRAF in the access stage of the pipeline and the 
previous instruction overwrites ST1 one cycle later. 


To avoid this conflict, it is recommended that only instructions listed in 
Table 7-14 be used to write to ST1 immediately prior to a RPTB[D] instruction. 
Any other instruction that writes to ST1 must not be immediately followed by 
a RPTBID] instruction. 


Table 7-14. Recommended Instructions for Writing to ST1 


To do this Use this instruction 
Store a value to ST1 STM #k, ST1 
Sil #k, ST1 
Copy a value from data memory to ST1 MVDK_ #k, ST1 
MVMD_ #k, ST1 
Clear a bit in ST1 RSBX 
Set a bit in ST1 SSBX 
Load ASM with a value LD #k, ASM 


LD Smem, ASM 
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7.5.7.2 Updating ARP in Compatibility Mode (CMPT = 1) and CMPT bit 
A pipeline conflict can occur if two conditions are simultaneously met: 


1 An instruction updates ARP or CMPT. 


[1 The next instruction uses ARP or CMPT to update the address pointer in 
indirect addressing mode. 


The conflict occurs because the second instruction uses ARP or CMPT ina 
pipeline stage that occurs before the previous instruction updates ARP or 
CMPT. 


Table 7-15 lists one instruction that does not have any latency in updating 
ARP when the CPU is in compatibility mode. Use this instruction wherever 
possible to avoid any conflicts. 


Table 7-15. Pipeline-Protected Instruction to Update ARP in Compatibility Mode 


(CMPT = 1) 
To do this: Use this instruction: 
Load ARP field of STO register LD #k, ARP 


Table 7—16 lists the latencies between instructions that update and use ARP 
or CMPT. 


a | 


Notes: 


1) You are responsible for rearranging instructions or inserting NOPs, if 
necessary, to accommodate latencies. 


2) In compatibility mode (CMPT = 1), ARP is automatically updated by 
instructions that use indirect addressing mode. There is no pipeline 
conflict associated with such an ARP update. 


3) ARP must always be set to 0 when the DSP is in standard mode 
(CMPT = 0). At reset, both ARP and CMPT are set to 0 automatically. 


sss | 
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Table 7-16. Latencies for ARP in Compatibility Mode (CMPT = 1) and CMPT bit 


(a) Latencies based on second-instruction category 


Second Instruction 


First Instruction Category | Category Il 
STM #Ik, status 2 2 
Sill #\k, status 

MVDK_ Smem, status 2 2 
MVMD_ MMR, status 

MVKD_ dmad, status 3 2 
MVDM_ dmad, status 

MVPD _ pmad, status 

MVPD pmad, status 3 3 
POPM status 3 2 
POPD status 

DELAY status 

LTD status 

MVDD status 

Store-type instruction (see Table 7-5) 3 2 
SSBX _ statbit 3 2 
RSBX _ statbit 


(b) Categories for the second instruction 


Category | Category Il 

MVKD_ dmad, auxind MVKD dmad, auxind 

MVDM_ dmad, auxind With a long-offset MVDM dmad, auxind ; 

MVPD dmad, auxind modifier MVPD pmad, auxind Without a long- 
MACP_ dmad, auxind, pmad, src MACP auxind, pmad, src offset modifier 
MACD  auxind, pmad, src MACD auxind, pmad, src 

ADD auxind, shift, src, dst With an ADD _auxind, shift, src, dst With an 

LD auxind, shift, dst extended shiftt LD —auxind, shift, dst extended shiftt 
STH _ sre, shift, auxind and along offset STH — src, shift, auxind and without a 
STL arc, shift, auxind modifier STL © sre, shift, auxind long-offset 
SUB __auxind, shift, src, dst SUB — auxind, shift, src, dst modifier 


All other instructions that use ARP or 
CMPT in indirect addressing mode 
with or without a long offset modifier. 


Legend: 


Note: 


status Destination operand pointing to STO or ST1 to update ARP or CMPT respectively in either direct or 
indirect addressing modes 

MMR Any memory-mapped register 

auxind Aread or write operand using indirect addressing mode 

statbit Destination operand writing to a bit in ARP or CMPT 

T Shift value between —16 and 15. 

Any instruction that does not fit in either of the two categories has zero latency. 
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Example 7-41. ARP Load With No Latency in Compatibility Mode (CMPT = 1) 


LD #1h, ARP ; This ARP load does not require any 
; latency. 


LD *ARO, A 


Example 7-42. ARP Load With a 2-Cycle Latency in Compatibility Mode (CMPT = 1) 


STLM A, STO ; The ARP field of STO is updated here. 
NOP 

NOP 

ADD *ARO+,-3,B ; The new ARP value is used here 


Example 7-43. ARP Load With a 3-Cycle Latency in Compatibility Mode (CMPT = 1) 


POPM STO ; The ARP field of STO is updated here. 
NOP 

NOP 

NOP 

LD *ARO+, A ; The new ARP value is used here. 


7.5.7.3 Updating DP in Direct Addressing Mode (CPL = 0) 
A pipeline conflict can occur if two conditions are simultaneously met: 


_j An instruction updates DP. 


1 The next instruction uses DP as the base address for direct addressing in 
noncompiler mode (CPL = 0). 


The conflict occurs because the second instruction uses DP in apipeline stage 
that occurs before the previous instruction updates it. 


Table 7-17 lists instructions that do not have any latency in writing to DP. It is 
recommended that these instructions be used wherever possible to avoid 
conflicts. 


Table 7-17. Recommended Instructions to Update DP in Noncompiler Mode (CPL = 0) 


To do this: Use this instruction: 
Load an immediate number to DP LD #k, DP 
Copy contents of a memory location to DP LD Smem, DP 


Table 7—18 lists the latencies between instructions that update DP and subse- 

quently use it. 

aT, | 
Note: 


You are responsible for rearranging instructions or inserting NOPs, if neces- 


sary, to accommodate latencies. 
eee eee ee ee ee | 


7-64 


Pipeline Latencies 


Table 7-18. Latencies for DP in Noncompiler Mode (CPL = 0) 


(a) Latencies based on second-instruction category 


Second Instruction 


First Instruction Category | Category Il 
STM #lk, status 2 2 
ST #lk, status 

MVDK_ Smem, status 2 2 
MVMD MMR, status 

MVKD_ dmad, status 6} 2 
MVDM_ dmad, status 

MVPD pmad, status 3 3 
POPM status 3 2 
POPD status 

MVDD status 

Store-type instruction (see Table 7-5) 3 2 
SSBX STO, statbit 3 2 
RSBX STO, statbit 


(b) Categories for the second instruction 


Category | Category Il 

All instructions that use DP for MVKD_ dmad, dirmem 
direct addressing mode except MVPD pmad, dirmem 
those listed in Category Il. MACP_ dirmem, pmad, src 


Legend: 


MACD_ dirmem, pmad, src 


ADD dirmem, shift, src, dst 

LD dirmem, shift, dst With an extended 
STH _ src, shift, dirmem shiftt 

STL src, shift, dirmem 

SUB dirmem, shift, src, dst 


status Destination operand pointing to STO to update DP in either direct or indirect addressing modes 


Note: 


MMR Any memory-mapped register 

statbit Destination operand writing to a bit in DP field of STO 

dirmem A read or write operand using direct addressing mode when CPL = 0 
t Shift value between —16 and 15. 

Any instruction that does not fit in either of the two categories has zero latency. 
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Example 7-44. DP Load With No Latency in Noncompiler Mode (CPL = 0) 


(a) 
LD #2h, DP ; This DP load does not require any 
; latency. 
LD 27h, A 
(b) 
LD 100h, DP ; This DP load does not require any 
; latency. 
LD 27h, A 


Example 7-45. DP Load With a 2-Cycle Latency in Noncompiler Mode (CPL = 0) 


STLM A, STO ; The DP field of STO is updated here. 
NOP 

NOP 

STH B,-3,27h ; The new DP value is used here. 


Example 7-46. DP Load With a 3-Cycle Latency in Noncompiler Mode (CPL = 0) 


POPM STO ; The DP field of STO is updated here. 
NOP 

NOP 

NOP 

LD 27h, A ; The new DP value is used here 


7.5.7.4 Updating CPL 


A pipeline conflict can occur if two conditions are simultaneously met: 


The conflict occurs because the second instruction reads CPL in a pipeline 


[1 An instruction modifies CPL. 
Lj The next instruction uses direct addressing mode. 


stage that occurs before the previous instruction updates it. 


Table 7-19 lists the latencies between instructions that update CPL and sub- 


sequently use it. 


Note: 


You are responsible for rearranging instructions or inserting NOPs, if necessary, 


to accommodate latencies. 
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Table 7-19. Latencies for the CPL Bit 


(a) Latencies based on second-instruction category 


Second Instruction 


First Instruction Category | Category Il 
STM #lk, status 2 1 
ST #lk, status 

MVDK_ Smem, status 2 1 
MVMD MMR, status 

MVKD_ dmad, status 6} 2 
MVDM_ dmad, status 

MVPD pmad, status 3 3 
POPM status 3 2 
POPD status 

MVDD status 

Store-type instruction (see Table 7-5) 3 2 
SSBX CPL 3 2 
RSBX CPL 


(b) Categories for the second instruction 


Category | Category Il 

All instructions that use direct MVKD_ dmad, dirmem 
addressing mode except those MVPD pmad, dirmem 
listed in Category Il. MACP_ dirmem, pmad, src 


MACD_ dirmem, pmad, src 


ADD dirmem, shift, src, dst 
LD dirmem, shift, dst 
STH src, shift, dirmem 
STL arc, shift, dirmem 
SUB dirmem, shift, src, dst 


With an extended 
shiftT 


MMR Any memory-mapped register 


Legend: 


Note: 


dirmem A read or write operand using direct addressing mode when CPL = 0 
status Destination operand pointing to ST1 to modify CPL in either direct, indirect, or memory-mapped 
addressing mode 
t Shift value between —16 and 15. 
Any instruction that does not fit in either of the two categories has zero latency. 
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Example 7-47. CPL Update With a 1-Cycle Latency 


STM #k, ST1 ; This instruction modifies the CPL 
* bit of STL. 

NOP 

MVKD 1000h, 30h ; Data read from 1000h is written to 
; (SP + 30h) 


Example 7-48. CPL Update With a 2-Cycle Latency 


RSBX CPL ; CPL is changed from 1 to 0. 
NOP ; Two NOPs are required, since a LD 
NOP ; with an extended shift is being 
; used to access an operand. 
LD 27h, -1, A ; The operand is read from the 
; current data page. 


Example 7-49. CPL Update With a 3-Cycle Latency 


SSBX CPL ; CPL is changed from 0 to 1. 
NOP ; These NOPs can be replaced by 
NOP ; other instructions that do not use 
NOP ; direct addressing mode to access 
; operands. 
LD 27h, A ; The operand is read at an offset 
; of 27h from SP. 


7.5.7.5 Updating SXM 


A pipeline conflict can occur if two conditions are simultaneously met: 


L] An instruction modifies SXM. 
[1 The next instruction uses SXM to control sign extension. 


The conflict occurs because the second instruction uses SXM in a pipeline 
stage that occurs before the previous instruction updates it. 


Table 7-20 lists the latencies between instructions that update SXM and sub- 
sequently use it. 


Note: 


You are responsible for rearranging instructions or inserting NOPs, if necessary, 


to accommodate latencies. 
eee ees 
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Table 7-20. Latencies for the SXM Bit 


(a) Latencies based on second-instruction category 


First Instruction 


MVKD_ dmad, status 
MVDM_ dmad, status 
POPM status 
POPD _ status 
MVDD Xmen, status 


Pipeline Latencies 


Second Instruction 
Category | 


{ 


Store-type instruction (see Table 7—5) 1 


SSBX SXM 
RSBX SXM 


(b) Category for the second instruction 


Category | 


All instructions affected by the sign-extension mode bit, except those that require 
an Smem operand with a long offset modifier (for example, LD *+AR1 (100h), A) 


Legend: 


Note: 


status 


Destination operand pointing at ST1 to update SXM in either direct, 


indirect, or memory-mapped addressing mode 


Any instruction that does not fit in Category | has zero latency. 


Example 7-50. SXM Update With No Latency 


RSBX 


ADD 


SXM : 


’ 


*+AR1(100h),A 


This instruction modifies the SXM bit of 
ord; 


Example 7-51. SXM Update With a 1-Cycle Latency 


(a) 
RSBX SXM ; This SXM load requires one cycle of 
; latency. 
NOP 
LD *ARS+, A 
(b) 
POPM ST1 ; This instruction modifies the SXM bit of 
NOP ; STL. 
ADD *AR2+,A 
(c) 
STLM A, ST1; This instruction modifies the SXM bit of 
» ST. 
NOP 
SUB *AR2-,A 
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7.5.7.6 ASM Field Used for Shift Operations 


A pipeline conflict can occur if two conditions are simultaneously met: 


[1 An instruction modifies ASM. 
1 The next instruction uses ASM as the shift-count value. 


The conflict occurs because the second instruction reads ASM in a pipeline 
stage that occurs before the previous instruction updates it. 


Table 7-21 lists instructions that do not have any latency for writing to the ASM 
bit field. Use these instructions wherever possible to avoid any conflicts. 


Table 7-21. Pipeline-Protected Instructions for Writing to ASM 


To do this: Use this instruction: 
Load an immediate number to ASM LD #k, ASM 
Copy contents of a memory location to ASM LD Smem, ASM 


Table 7—22 lists the latencies between instructions that write to ASM and those 
that subsequently use it. 


a | 


Note: 


You are responsible for rearranging instructions or inserting NOPs, if necessary, 
to accommodate latencies. 


| 


Table 7-22. Latencies for ASM Bit Field 


Pipeline Latencies 


(a) Latencies based on second-instruction category 


First Instruction 


MVKD_ dmad, status 
MVDM_ dmad, status 
POPM status 
POPD status 
MVDD Xmen, status 


Store-type instruction (see Table 7-5) 


SSBX __ STI, asmbit 
RSBX _— STI, asmbit 


(b) Category for the second instruction 


Category | 

STH src, ASM, Smem 

STL src, ASM, Smem 

ST src, Ymem 

|| LD/ADD/SUB/MAC/MAS/MPY 
SACCD src, Smem, cond 


LD src, ASM, dst 
ADD src, ASM, dst 
SUB src, ASM, dst 


Legend: asmbit 


Second Instruction 
Category | 


{ 


L Without a long-offset modifier 


Destination operand writing to a bit in ASM field of ST1 


status Destination operand pointing at ST1 to update ASM in direct, indirect,or 
memory-mapped addressing mode 


Note: Any instruction that does not fit in either of the two categories has zero latency. 
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Example 7-52. ASM Update With No Latency 


(a) 
LD #6, ASM ; This instruction loads ASM with no 
; latency. 
STH A, ASM, *AR1+ 
(b) 
LD 100h, ASM ; This instruction loads ASM with no 
; latency 
ADD A,ASM,B 
(c) 


STLM A, ST1 ; This instruction 
; field of ST1l. No 
; Since STL uses a 
; modifier. 

STL A, ASM, *+AR5 (100h) 


modifies the ASM 
latency is needed 
long offset 


Example 7-53. ASM Update With a 1-Cycle Latency 


POPM ST1 ; This instruction 
NOP ; field of STI 
SUB A,ASM,B 


modifies the ASM 
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7.5.8 Latencies in Repeat-Block Loops 
The following status register fields and bits are affected by latency: 


11 BRC (block-repeat counter register) 
_j BRAF (block-repeat active flag) 


7.5.8.1 Updating Block-Repeat Counter (BRC) Register 
A pipeline conflict can occur if two conditions are simultaneously met: 


[1 An instruction writes to the BRC register 
_j The next instruction is an RPTB[D] 


The conflict occurs because the second instruction reads BRC in a pipeline 
stage that occurs before the previous instruction updates it. 


There are certain instructions which do not cause any pipeline conflicts when 
updating BRC. Use these instructions wherever possible to avoid conflicts. 


Table 7-23. Recommended Instructions for Writing to BRC Before an RPTB Loop 


To do this Use this instruction 
Write an immediate value to BRC STM #ik, BRC 
Copy a memory location to BRC MVDK Smem, BRC 


Table 7-24 lists latencies between instructions that update BRC and an 
RPTB[D] instruction. 


NE 
Notes: 


1) Donot place instructions that modify BRC in the delay slots of a RPTBD 
instruction. 


2) You are responsible for rearranging instructions or inserting NOPS, if 


necessary, to accommodate latencies. 
eee ee ee ee ee ee ee | 


Table 7-24. Latencies for Updating BRC Before an RPTB Loop 


Latency if Second Instruction 
First Instruction Is RPTB[D] 


MVDK Smem, BRC 0 
MVMD MMR, BRC 


STM ~~ #k, BRC 0 
ST #k, BRC 


All other instructions that modify BRC 1 
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Example 7-54. Loading BRC Before Executing a New Repeat-Block Loop 


(a) 
STM #1k,BRC ; There is no latency when BRC is 
RPTB endloop-1 ; loaded via STM before a new RPTB 
abi ; loop. 

endloop: 

(b) 
MVDK count,BRC ; There is no latency when BRC is 
RPTBD endloop-1 ; loaded using MVDK before a new 
avees ; RPTB loop 

endloop: 

(c) 
STLM A, BRC ; There is a 1 cycle latency when 
NOP 7 BRC is loaded using an STLM 
RPTB endloop-1 ; Instruction. 

endloop: 

(d) 
POPM BRC ; There is a 1 cycle latency when 
NOP 7 BRC is loaded using a POPM 
RPTBD endloop-1 ; instruction. 

endloop: 


In a repeat-block loop, BRC is decremented when the last instruction in the 
loop is in the decode stage of the pipeline. However, the SRCCD instruction 
writes the BRC’s contents in the execute stage of the pipeline. This can result 
in an incorrect BRC value written by the SRCCD instruction. The pipeline con- 
flict can be avoided by placing the SRCCD instruction at least three instruction 
words from the bottom of the loop, as shown in Example 7-55 and 
Example 7-56. 


Example 7-55. SRCCD Instruction With No Latency 


RPTB endloop-1 


SRCCD *AR3, ALEQ ; Placing the SRCCD instruction in 
; this position ensures that current 
; value of BRC will be written 
; to memory. 


ADD *AR1+,A 

SUB *AR2-,A 

STH A, *AR1+ 
endloop: 
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Example 7-56. SRCCD Instruction With a 3-Cycle Latency 


RPTB 
SRCCD 
NOP 
NOP 


NOP 
endloop: 


endloop-1 


*AR3, ALEQ 


;This ensures that current value of 
; BRC will be written to memory. 


a 


There is also a 5-to-6-cycle latency when writing anew value to BRC from with- 
inaRPTB loop. The latencies described in Table 7-25 are relevant only if BRC 
is modified while a RPTB loop is active. See Example 7-57 for details. 


Table 7-25. Latencies for Updating BRC From Within an RPTB Loop 


Instruction 


STM #lk, BRC 
ST #lk, BRC 
MVDK Smem, BRC 
MVMD MMR, BRC 


All other instructions that modify 


BRC 


Latency 


The next 5 instruction words must not contain 
the last instruction in the RPTB loop. 


The next 6 instruction words must not contain 
the last instruction in the RPTB loop. 


Example 7-57. Modifying BRC From Within an RPTB Loop 


RPTB 
XC 
MVDK 
LD 
ADD 
SUB 
LD 
MPYA 


STL 
endloop: 


endloop-1 


2, Condition 
5h, BRC 


*AR1+, 
*AR2-, 
*AR3+, 
*AR4+, 
A 

A, *AR3+ 


1p PP 


Ne Ne Ne Ne Ne Ne Ne 


If Condition is evaluated as 
true, write the new count value 
to BRC. 

These six instructions provide 
sufficient latency for the new 
BRC value to tak ffect befor 
the next iteration begins. 


7.5.8.2 Deactivating the Block-Repeat Active Flag (BRAF) 


The ’54x sets the block-repeat active flag (BRAF) to 1 to indicate that the 
repeat-block loop is active. BRAF is set or cleared in the decode stage of the 
first instruction of the repeat-block loop during the last loop iteration (BRC = 0). 
BRAF is tested by the device at the end of each loop iteration to determine 
whether the next prefetch will be from the top of the loop or not. 
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BRAF can be deactivated in software to terminate the repeat-block premature- 
ly. This, however, must be done early enough in the pipeline so that BRAF is 
cleared prior to the prefetch of the instruction at the top of the loop. Therefore, 
an instruction that clears BRAF (such as RSBX) must be placed at least six 
instructions words before the end of the repeat-block loop. This is shown in 
Example 7-58. 


Example 7-58. BRAF Deactivation 


RPTB endloop-1 
RSBX BRAF ; This ensures that the loop will 
NOP ; terminate after completing this 
NOP ; iteration. 
NOP 
NOP ; These six NOPs may be replaced by 
NOP » other instructions. 
NOP 

endloop: 


7.5.9 Latencies for the PMST 
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PMST fields OVLY, DROM, MP/MC, and IPTR configure the 54x memory 
space. When an instruction modifies one of these fields, a certain number of 
pipeline cycles must pass before the new memory space can be accessed. 
This latency is required because the PMST fields are updated in the read or 
execute stage of the pipeline while instructions in earlier pipeline stages may 
be accessing that memory space. In the case of program memory control via 
OVLY, IPTR, and MP/MC fields, the prefetch stage of the pipeline evaluates 
these bits. In the case of data memory control via the DROM field, the access 
or read stage evaluates this bit. 


If an external memory access occurs, additional cycles are required for opera- 
tions affected by the changing bit. Any change in PMST fields is delayed until 
an in-process external access is completed. For example, if an external 
memory access is occurring while an instruction in the execute stage of the 
pipeline tries to modify OVLY, the update is delayed until the external bus cycle 
is completed. This, in turn, requires additional cycles to those listed in the 
following tables. 


Table 7-26 lists the latencies between instructions that write to the OVLY, 
IPTR, or MP/MC bit fields and those instructions that are subsequently fetched 
from the new memory space. 


NS 
Note: 


You are responsible for rearranging instructions or inserting NOPs, if necessary, 


to accommodate latencies. 
eee eee EEN EEE ggg 


Pipeline Latencies 


Table 7-26. Latencies for OVLY, IPTR, and MP/MC Bits 


(a) Latencies based on second-instruction category 


Second Instruction 


First Instruction Category! Category Il Category Ill Category IV 
STM #lk, pmst 0 0 1 2 
ST #lk, pmst 


MVDK dmad, pmst 
MVMD MMR, pmst 


All other instruc- 0 1 2 3 
tions that modify 

OVLY, IPTR, 

MP/MC 


(b) Categories for the second instruction 


Category | Category Il Category Ill Category IV 
BACC[D] BC[D] B[D] INTR 
CALA[D] CCID] BANZ[D] RETF[D] 
FRET[D] RC[D] CALL[D] TRAP 
FRETE[D] RET[D] FB[D] 

FBACC[D] RETE[D] FCALL[D] 

FCALAID] 


Legend: pmst Destination operand pointing at PMST to modify OVLY, IPTR, MP/MC in 
either direct, indirect, or memory-mapped addressing modes 


Notes: 1. Additional latency cycles are required if an external memory access is in 
progress when an instruction is trying to modify OVLY, IPTR, or MP/MC bit fields. 
2. The second instruction loads PC with a new value that points to the modified 
program address range. 


Example 7-59. OVLY Setup Followed by an Unconditional Branch (DP = 0) 


ORM #20h, PMST ; This instruction sets OVLY to 1. 
NOP 
NOP 
B onchip ; Branch to on-chip dual-access 
7 memory. 


Example 7-60. OVLY Setup Followed by a Conditional Branch 


MVDK 5h, PMST ; This instruction sets OVLY to 1. 
BC onchip, AEQ ; Branch to on-chip dual-access 
; memory. 
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Example 7-61. OVLY Setup Followed by a Return (DP = 0) 


ANDM #0ffdfh, PMST ; This instruction sets OVLY to 0. 
NOP 
RET ; Return to off-chip memory. 


Example 7-62. MP/MC Setup Followed by an Unconditional Delayed Call 


STLM A, PMST ;This instruction sets MP/MC to 1. 
NOP 
NOP 
CALLD offchip ; Call a routine in external 

; program memory after executing 
STM #k, ARI ; this 2-word instruction. 


Example 7-63. IPTR Setup Followed by a Software Trap 


STM #k, PMST 


This instruction relocates 


NOP interrupt vectors by writing to 
NOP the IPTR bit field. 
TRAP Fetch a TRAP vector from the 


Ne Ne Ne Ne Ne 


relocated vector table. 


Table 7-27 lists the latencies between instructions that write to the DROM bit 

of PMST and those that subsequently read from or write to the DROM address 
range. 

(aca | 
Note: 


You are is responsible for rearranging instructions or inserting NOPs, if neces- 


sary, to accommodate latencies. 
yyy yh os— ooo 
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Table 7-27. Latencies for the DROM Bit 


(a) Latencies based on second-instruction category 
And second instruction is 
First instruction is... Category I, the latency is... 


STM #lk, drom 2 
ST #lk, drom 

MVDK_ dmad, drom 

MVMD_ MMR, drom 


All other instructions that modify 3 
DROM 


(b) Catagory for the secona-instruction 
Category | 
All instructions that read from or write to the DROM address range 


Legend: drom Destination operand pointing at PMST to modify DROM bit in either direct , 
indirect, or memory-mapped addressing modes 


Notes: 1. Additional latency cycles are required if an external memory access is occurring 
at the time when an instruction is trying to modify the DROM bit field. 
2. Any instruction not listed in this table that modifies DROM bit of PMST register 
has zero latency. 


Example 7-64. DROM Setup Followed by a Read Access (DP = 0) 


ORM #8h, PMST ; This instruction sets DROM = 1. 
NOP 

NOP 

NOP 

LD *AR3, A ; Reads from on-chip DROM 


Example 7-65. DROM Setup Followed by a Dual-Read Access 


STM #k, PMST ; This instruction sets DROM = 1. 
NOP 
NOP 
MPY *AR3+, *AR4+, A ; This instruction reads from 
7 on-chip DROM. 
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7.5.10 Latencies for Memory-Mapped Accesses to Accumulators 


Accumulators A and B can be addressed as memory-mapped registers using 
memory-mapped, direct, or indirect addressing modes. Generally, itis not use- 
ful to access accumulators as memory-mapped registers, because the 54x 
instruction set supports direct accesses to the accumulators. Some examples 
of the instructions that support direct access to accumulators are ADD, SUB, 


AND, OR, XOR, LD, STH, STL, MAC, and MPYA. 


When the two accumulators are accessed using instructions that do not 
access them as memory-mapped registers, no pipeline latencies occur. In rare 
cases when you access the accumulators through the memory-mapped regis- 
ters AG, AH, AL, BG, BH, and BL, you can use any instruction that uses 
memory-mapped, direct, or indirect addressing modes to access operands. 
Examples of such instructions are POPM AL and PSHM AH. Note that DP 
must be zero in order to access memory-mapped registers via direct addres- 


sing modes. 


A pipeline conflict can occur when two conditions are simultaneously met: 


register. 


[J One instruction modifies an accumulator (either A or B) directly. 


(1 The next instruction tries to read that accumulator as a memory-mapped 


The conflict occurs because the first instruction updates an accumulator at the 
same time when the next instruction tries to read it as a memory-mapped 


register. 


Example 7-66. Accumulator Access With a 1-Cycle Latency 


ADD Smem, A 
instruction. 
NOP 


A is updated directly by this 


This conflict occurs because the 
next instruction tries to read A 


PSHM AL 


, 
i 
i 
; 
; aS a memory-mapped register. A 
i 
; 
; 


one-cycle latency required. 
This instruction reads A asa 
memory-mapped register. 
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Example 7-67. Accumulator Access With No Conflict 
(a) 


ADD Smem, A ; A is updated directly by this 
; instruction. No conflict occurs 
; because the next instruction reads 
7 accumulator A directly. 

NEG A ; This instruction reads A directly. 


(b) 


STLM A, BH ; BH is written using memory-mapped 
; addressing here. 
; No conflict occurs because the 
; next instruction also accesses the 
; Same accumulator as a memory- 
7 mapped register. 

PSHM BH ; Reads BH as a memory-mapped 
; register. 


(c) 


STLM A, BH ; BH is written using memory-mapped 
; addressing here. 
; No conflict occurs because the 
; next instruction accesses the same 
7 accumulator directly. 

NEG B ; This instruction reads B directly. 


Table 7—28 lists the latencies between instructions that update an accumulator 
directly and instructions that access the same accumulator as a memory- 
mapped register. 


TT, | 
Note: 


You are responsible for rearranging instructions or inserting NOPs, if necessary, 


to accommodate latencies. 
ae ee EEE EE EEE EEE EE EE EEE. = _aaaar| 
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Table 7-28. Latencies for Accumulators A and B When Used as Memory-Mapped 


Registers 


(a) Latencies based on second-instruction category 


Second Instruction 


Second Instruction 


First Instruction Category | Category Il 
All 1-word instructions that directly modify A or 1 0 
B without accessing them as memory-mapped 
registers 
ADD —_Smenm, shiftT, src, dst 1 0 
LD Smem, shiftt, dst 
SUB Smem, shift, src, dst 
All 2-word instructions that directly modify A or 0 0 
B without accessing them as memory-mapped 
registers 
(b) Categories for the second instruction 
Category | Category II 
All instructions that read an All instructions that read an ac- 
accumulator as a memory- cumulator as memory-mapped 
mapped register (AG, AH, AL, register (AG, AH, AL, BG, BH, 
BG, BH, and BL) without using BL) using a long-offset modifier 
a long-offset modifier. 
MVKD- accum, Smem 
MVDM_ accum, MMR 
ADD accum, shift, src, dst : : 
LD accum, shift, dst } fe a all ai au 
SUB accum, shift, src, dst 
Legend: MMR Any memory-mapped register 
accum Source operand pointing to AG, AH, AL, BG, BH, or BL using memory-mapped, direct, or 
indirect addressing modes 
Tt Shift value between —16 and 15. 


Example 7-68. Updating Accumulator With a 1-Cycle Latency 


LD Smem, 
NOP 
PSHM BH 


=i, 


B 


B is updated directly by this 
instruction 

Conflict occurs because next 
instruction tries to read B as 
a memory-mapped register. A 
one-cycle latency is required. 
Reads BH as a memory-mapped 
register. 


Ne Ne Ne Ne Ne Ne Ne Ne 
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Example 7-69. Updating Accumulator With No Latency 


(a) 
MAC #K,A ; A is updated directly by this 
; instruction. No latency is 
; required since MAC is a 2-word 
; instruction. 
PSHM AL ; This instruction reads A as a 
7 memory-mapped register. 
(b) 


ADD Smem, A , A is updated directly by this 
; instruction. No latency is 
; required since the next 
; instruction uses a long offset 
; modifier. 

LD * (AL) ,ASM ; This instruction reads A as a 
; memory-mapped register. 
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Chapter 8 


On-Chip Peripherals 


On-chip peripherals for the ‘54x are specific to the individual device. This chap- 
ter, along with Chapter 9, Serial Ports, and Chapter 10, External Bus Opera- 
tion, describes all of the available on-chip peripherals; however, your device 
may contain only a subset of them. 


Enhanced peripherals, available on specific ‘54x devices, are not discussed 
at length in this chapter. For detailed information about enhanced peripherals, 
see volume 5 of this reference set: TMS320C54x DSP, Enhanced Peripherals, 
(literature number SPRU302). 


All 54x devices have general-purpose I/O pins, a timer, a clock generator, a 
software-programmable wait-state generator, and a programmable bank- 
switching module. Different types of serial ports, host port interfaces, and clock 
generators are device-specific peripherals. The serial ports are discussed in 
Chapter 9, Serial Ports, and the software-programmable wait-state generator 
and programmable bank-switching module are discussed in Chapter 10, Ex- 
ternal Bus Operation. 


Topic Page 
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Available On-Chip Peripherals / Peripheral Memory-Mapped Registers 


8.1 Available On-Chip Peripherals 


The following on-chip peripherals are available on ’54x devices: 


CV General-purpose I/O pins: XF and BIO 
Lj Timer 
41 Clock generator 
Lj) Host port interface 
M 8-bit standard ('542, ’545, ’548, ’549) 
M 8-bit enhanced (’5402, 5410 — see note below) 
mM 16-bit enhanced (’5420 — see note below) 
[1 Synchronous serial port (541, 545, and ’546) 
[1 Buffered serial port (542, 543, 545, ’546, 548, and 549) 
[1 Multichannel buffered serial port (McBSP) (’5402, 5410, and ’5420 — see 
note below) 
iz 
i 
iz 


Time-division multiplexed (TDM) serial port (542, 543, 548, and ’549). 
Software-programmable wait-state generator 
Programmable bank-switching module 


ae | 
Note: Enhanced Peripherals 


For more detailed information on the enhanced peripherals, see volume 5 
of this reference set: TMS320C54x DSP, Enhanced Peripherals, literature 
number SPRU302. 


8.2 Peripheral Memory-Mapped Registers 
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Peripherals are operated and controlled by accessing memory-mapped con- 
trol and data registers. These registers can also transfer data to and from the 
peripherals. Setting and clearing bits in the control registers can enable, dis- 
able, initialize, and dynamically reconfigure the peripherals. The operations of 
the serial ports and the timer are synchronized to the CPU through interrupts 
or interrupt polling. When peripherals are not in use, the internal clocks are 
shut off; thus, the peripherals consume less power in normal run mode or in 
idle mode. 


The peripheral registers are mapped into data page 0. Table 8—1 through 
Table 8-7 list the individual peripheral memory-mapped registers for each ver- 
sion of the ’54x. 


Peripheral Memory-Mapped Registers 


Table 8-1. '541/541B Peripheral Memory-Mapped Registers 


Address 


(Hex) 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 

2A-2F 
30 
31 
32 

33-57 
58 

59-57 


Name 
DRRO 


DXRO 
SPCO 


Description 

Serial port 0 data receive register 
Serial port 0 data transmit register 
Serial port 0 control register 
Reserved 

Timer register 

Timer period register 

Timer control register 

Reserved 

Software wait-state register 
Bank-switching control register 
Reserved 

Serial port 1 data receive register 
Serial port 1 data transmit register 
Serial port 1 control register 
Reserved 

Clock mode register ('541B only) 


Reserved 
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Table 8-2. 542 Peripheral Memory-Mapped Registers 


Address 
(Hex) 


20 
21 
22 
23 
24 
25 
26 
27 
28 
29 

2A-2B 
2C 

2D-2F 
30 
31 
32 
33 
34 
35 

36-37 
38 
39 
3A 
3B 

3C-5F 


Name 
BDRRO 


BDXRO 
BSPCO 
BSPCEO 
TIM 
PRD 


Description 


Buffered serial port data receive register 
Buffered serial port data transmit register 
Buffered serial port control register 
Buffered serial port control extension register 
Timer register 

Timer period register 

Timer control register 

Reserved 

Software wait-state register 
Bank-switching control register 
Reserved 

Host port interface control register 
Reserved 

TDM serial port data receive register 
TDM serial port data transmit register 
TDM serial port control register 

TDM serial port channel select register 
TDM serial port receive transmit register 
TDM serial port receive address register 
Reserved 

ABU transmit address register 

ABU transmit buffer-size register 

ABU receive address register 

ABU receive buffer-size register 


Reserved 


Peripheral Memory-Mapped Registers 


Table 8-3. ‘543 Peripheral Memory-Mapped Registers 


Address 
(Hex) 


20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
2A-2F 
30 
31 
32 
33 
34 
35 
36-37 
38 
39 
3A 
3B 
3C-5F 


Name 
BDRRO 


BDXRO 
BSPCO 
BSPCEO 
TIM 
PRD 


Description 


Buffered serial port data receive register 
Buffered serial port data transmit register 


Buffered serial port control register 


Buffered serial port control extension register 


Timer register 

Timer period register 

Timer control register 

Reserved 

Software wait-state register 
Bank-switching control register 
Reserved 

TDM serial port data receive register 
TDM serial port data transmit register 
TDM serial port control register 

TDM serial port channel select register 
TDM serial port receive transmit register 
TDM serial port receive address register 
Reserved 

ABU transmit address register 

ABU transmit buffer-size register 

ABU receive address register 

ABU receive buffer-size register 


Reserved 
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Table 8-4. ‘545/'545A Peripheral Memory-Mapped Registers 


Address 
(Hex) 


20 
21 
OP 
23 
24 
25 
26 
27 
28 
29 
2A-2B 
2C 
2D-2F 
30 
31 
32 
33-37 
38 
39 
3A 
3B 
3C-57 
58 
59-5F 


Name 
BDRRO 


BDXRO 
BSPCO 
BSPCEO 
TIM 
PRD 


Description 

Buffered serial port data receive register 
Buffered serial port data transmit register 
Buffered serial port control register 
Buffered serial port control extension register 
Timer register 

Timer period register 

Timer control register 

Reserved 

Software wait-state register 
Bank-switching control register 
Reserved 

Host port interface control register 
Reserved 

Serial port data receive register 

Serial port data transmit register 

Serial port control register 

Reserved 

ABU transmit address register 

ABU transmit buffer-size register 

ABU receive address register 

ABU receive buffer-size register 
Reserved 

Clock mode register (545A only) 


Reserved 


Peripheral Memory-Mapped Registers 


Table 8-5. '546/546A Peripheral Memory-Mapped Registers 


Address 
(Hex) Name Description 


20 BDRRO Buffered serial port data receive register 
21 BDXRO Buffered serial port data transmit register 


22 BSPCO Buffered serial port control register 


23 BSPCEO Buffered serial port control extension register 
24 TIM Timer register 

25 PRD Timer period register 

26 TCR Timer control register 

27 === Reserved 


28 SWWSR __ Software wait-state register 


29 BSCR Bank-switching control register 


QAR SSS Reserved 
30 DRR1 Serial port data receive register 
31 DXR1 Serial port data transmit register 
32 SPC1 Serial port control register 
CER SS Reserved 


38 AXRO ABU transmit address register 

39 BKXO ABU transmit buffer-size register 

3A ARRO ABU receive address register 

3B BKRO ABU receive buffer-size register 
30-57) --- Reserved 

58 CLKMD ~~ Clock mode register (546A only) 
59-5F = —---— Reserved 
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Table 8-6. 548 Peripheral Memory-Mapped Registers 


Address 
(Hex) 


20 
21 
22 
23 
24 
25 
26 
27 
28 
29 

2A-2B 
2C 

2D-2F 
30 
31 
32 
33 
34 
35 

36-37 
38 
39 
3A 


3B 


Name 


BDRRO 
BDXRO 
BSPCO 
BSPCEO 
TIM 


PRD 


Description 


Buffered serial port 0 data receive register 
Buffered serial port 0 data transmit register 
Buffered serial port 0 control register 
Buffered serial port 0 control extension register 
Timer register 

Timer period register 

Timer control register 

Reserved 

Software wait-state register 
Bank-switching control register 

Reserved 

Host port interface control register 
Reserved 

TDM serial port data receive register 
TDM serial port data transmit register 
TDM serial port control register 

TDM serial port channel select register 
TDM serial port receive transmit register 
TDM serial port receive address register 
Reserved 

ABU 0 transmit address register 

ABU 0 transmit buffer-size register 

ABU 0 receive address register 


ABU 0 receive buffer-size register 


Peripheral Memory-Mapped Registers 


Table 8-6 °’548 Peripheral Memory-Mapped Registers (Continued) 


Address 
(Hex) 


3C 
3D 
3E 
3F 
40 
i 
42 
43 

44-57 
58 

59-5F 


Name 
AXR1 


BKX1 
ARR1 
BKR1 
BDRR1 
BDXR1 
BSPC1 
BSPCE1 


Description 

ABU 1 transmit address register 

ABU 1 transmit buffer-size register 

ABU 1 receive address register 

ABU 1 receive buffer-size register 
Buffered serial port 1 data receive register 
Buffered serial port 1 data transmit register 
Buffered serial port 1 control register 
Buffered serial port 1 control extension register 
Reserved 

Clock-mode register 


Reserved 


Table 8-7. ‘549 Peripheral Memory-Mapped Registers 


Address 
(Hex) 


20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
2A 
2B 


Name 


BDRRO 
BDXRO 
BSPCO 
BSPCEO 
TIM 
PRD 


Description 

Buffered serial port 0 data receive register 
Buffered serial port 0 data transmit register 
Buffered serial port 0 control register 

Buffered serial port 0 control extension register 
Timer count register 

Timer period register 

Timer control register 

Reserved 

External interface software wait-state register 
External interface bank-switching control register 
Reserved 


Extended software wait-state register 
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Table 8-7 °549 Peripheral Memory-Mapped Registers (Continued) 
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Address 
(Hex) 


BC 
2D-2F 
30 
31 
32 
33 
34 
35 
36-37 
38 
39 
3A 
3B 
3C 
3D 
3E 
3F 
40 
A 
42 
43 
44-57 
58 
59-5F 


Name 
HPIC 


AXRO 
BKXO 
ARRO 
BKRO 
AXR1 
BKX1 
ARR1 
BKR1 
BDRR1 
BDXR1 
BSPC1 
BSPCE1 


Description 


Host port interface control register 
Reserved 

TDM serial port data receive register 
TDM serial port data transmit register 
TDM serial port control register 

TDM serial port channel select register 
TDM serial port receive transmit register 
TDM serial port receive address register 
Reserved 

ABU 0 transmit address register 

ABU 0 transmit buffer-size register 

ABU 0 receive address register 

ABU 0 receive buffer-size register 

ABU 1 transmit address register 

ABU 1 transmit buffer-size register 

ABU 1 receive address register 

ABU 1 receive buffer-size register 
Buffered serial port 1 data receive register 
Buffered serial port 1 data transmit register 
Buffered serial port 1 control register 
Buffered serial port 1 control extension register 
Reserved 

Clock-mode register 


Reserved 


Peripheral Memory-Mapped Registers 


Table 8-8. ‘5402 Peripheral Memory-Mapped Registers 


Address 
(Hex) 


20 
21 

22 
23 
24 
25 
26 
27 
28 
29 
2A 
2B 
2C 

2D-2F 

30 
31 

32 


33-37 


38 


39 


3A-3B 
3C 


3D 


GPIOCR 


GPIOSR 


Description 


McBSP0 data receive register 2 
McBSP0 data receive register 1 
McBSP0 data transmit register 2 
McBSPO0 data transmit register 1 
TimerO register 

TimerO period counter 

Timer0O control register 
Reserved 

Software wait-state register 
Bank-switching control register 
Reserved 

Software wait-state control register 
HPI control register 

Reserved 

Timer1 register 

Timer1 period register 

Timer1 control register 
Reserved 


McBSP0O serial port sub-bank address register (See 
Table 8-11 on page 8-17.) 


McBSP0O serial port sub-bank data register (See 
Table 8-11 on page 8-17.) 


Reserved 
General purpose 1/O pins control register 


General purpose |/O pins status register 
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Table 8-8. 5402 Peripheral Memory-Mapped Registers (Continued) 


Address 
(Hex) 


3E-3F 
40 
A 
42 
43 

44-47 


48 


49 


4A-53 
54 


55 


56 


57 


58 
59-5F 


8-12 


Name 


DMPREC 


DMSA 


DMSDI 


DMSDN 


CLKMD 


Description 


Reserved 

McBSP1 data receive register 2 

McBSP1 data receive register 1 

McBSP1 data transmit register 2 
McBSP1 data transmit register 1 
Reserved 


McBSP1 serial port sub-bank address register (See 
Table 8—11 on page 8-17.) 


McBSP1 serial port sub-bank data register (See 
Table 8-11 on page 8-17.) 


Reserved 
DMA channel priority and enable control register 


DMA sub-bank address register (See Table 8-12 on 
page 8-18.) 


DMA sub-bank data register with sub-bank address 
auto-increment (See Table 8-12 on page 8-18.) 


DMA sub-bank data register (See Table 8-12 on page 
8-18.) 


Clock mode register 


Reserved 


Peripheral Memory-Mapped Registers 


Table 8-9. ‘5410 Peripheral Memory-Mapped Registers 


Address 
(Hex) Name Description 
20 DRR20 McBSP0 data receive register 2 
21 DRR10 McBSP0 data receive register 1 
22 DXR20 McBSP0 data transmit register 2 
23 DXR10 McBSPO0 data transmit register 1 
24 TIM Timer register 
25 PRD Timer period counter 
26 TCR Timer control register 
27 === Reserved 
28 SWWSR __ Software wait-state register 
29 BSCR Bank-switching control register 
2A SSS Reserved 
2B SWCR Software wait-state control register 
2C HPIC HPI control register 
2D-2F am Reserved 
30 DRR22 McBSP2 data receive register 2 
31 DRR12 McBSP2 data receive register 1 
32 DXR22 McBSP2 data transmit register 2 
33 DXR12 McBSP2 data transmit register 1 
34 SPSA2 Eee aoe address register (See 
35 SPSD2 aa eae is data register (See 
36-37 ==> Reserved 
3A-3F =o Reserved 
38 SPSAO eae pps ar al address register (See 
39 SPSDO McBSPO serial port sub-bank data register (See 


Table 8-11 on page 8-17.) 
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Table 8-9. 5410 Peripheral Memory-Mapped Registers (Continued) 


Address 
(Hex) 


40 
i 
42 
43 

44-47 


48 


49 


4A-53 
54 
55 


56 


57T 


58 


59 — 5F 
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Name 
DRR21 


DRR11 
DXR21 
DXR11 


DMPREC 
DMSA 


DMSDI 


DMSDN 


CLKMD 


Description 


McBSP1 data receive register 2 
McBSP1 data receive register 1 

McBSP1 data transmit register 2 
McBSP1 data transmit register 1 
Reserved 


McBSP1 serial port sub-bank address register (See 
Table 8—11 on page 8-17.) 


McBSP1 serial port sub-bank data register (See 
Table 8-11 on page 8-17.) 


Reserved 
DMA channel priority and enable control register 
DMA sub-bank-address register 


DMA sub-bank data register with sub-bank address 
auto-increment (See Table 8-12 on page 8-18.) 


DMA sub-bank data register (See Table 8-12 on page 
8-18.) 


Clock mode register 


Reserved 


Peripheral Memory-Mapped Registers 


Table 8-10. 5420 Peripheral Memory-Mapped Registers For Each DSP Subsystem 


Address 
(Hex) Name Description 
20 DRR20 MCBSP 0 data receive register 2 
21 DRR10 MCBSP 0 data receive register 1 
22 DXR20 MCBSP 0 data transmit register 2 
23 DXR10 MCBSP 0 data transmit register 1 
24 TIM Timer register 
25 PRD Timer period counter 
26 TCR Timer control register 
27 i Reserved 
28 SWWSR __ Software wait-state register 
29 BSCR Bank switching control register 
2A === Reserved 
2B SWCR Software wait-state control register 
2C HPIC HPI control register 
2D-2F =—— Reserved 
30 DRR22 MCBSP 2 data receive register 2 
31 DRR12 MCBSP 2 data receive register 1 
32 DXR22 MCBSP 2 data transmit register 2 
33 DXR12 MCBSP 2 data transmit register 1 
34 SPSA2 MCBSP 2 serial port sub-bank address register (See 
Table 8-11 on page 8-17.) 
35 SPSD2 MCBSP 2 serial port sub-bank data register (See 
Table 8-11 on page 8-17.) 
36-37 === Reserved 
38 SPSAO MCBSP 0 serial port sub-bank address register (See 
Table 8-11 on page 8-17.) 
39 SPSDO MCBSP 0 serial port sub-bank data register (See 


Table 8-11 on page 8-17.) 
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Table 8-10. ‘5420 Peripheral Memory-Mapped Registers For Each DSP Subsystem 


(Continued) 


Address 
(Hex) 


3A-3B 
3C 

3D-3F 
40 
At 
42 
43 

44-47 
48 


49 


4A-53 
54 
55 


56 
57 


58 
59-5F 
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DMPREC 
DMSA 


DMSDI 


DMSDN 


CLKMD 


Description 

Reserved 

General purpose I/O register 
Reserved 

MCBSP 1 data receive register 2 
MCBSP 1 data receive register 1 
MCBSP 1 data transmit register 2 
MCBSP 1 data transmit register 1 
Reserved 


MCBSP 1 serial port sub-bank address register (See 
Table 8-11 on page 8-17.) 


MCBSP 1 serial port sub-bank data register (See 
Table 8-11 on page 8-17.) 


Reserved 
DMA channel priority and enable control register 


DMA sub-bank address register (See Table 8—12 on 
page 8-18.) 


DMA sub-bank data register with sub-bank address 
auto-increment (See Table 8-12 on page 8-18.) 


DMA sub-bank data register (See Table 8—12 on page 
8-18.) 


Clock mode register 


Reserved 


Peripheral Memory-Mapped Registers 


Table 8-11. °5402/5410/5420 McBSP Sub-addressed Registers 


McBSPO McBSP1 McBSP2 Sub- 
Address Address Address address 

Name (Hex) Name (Hex) Name (Hex) (Hex) Description 

SPCR10 39 SPCR11 49 SPCR12 35 00 Serial port control register 1 

SPCR20 39 SPCR21 49 SPCR22 35 01 Serial port control register 2 

RCR10 39 RCR11 49 RCR12 35 02 Receive control register 1 

RCR20 39 RCR21 49 RCR22 35 03 Receive control register 2 

XCR10 ts) XCR11 49 XCR12 35 04 Transmit control register 1 

XCR20 39 XCR21 49 XCR22 35 05 Transmit control register 2 

SRGR10 39 SRGRIt 49 SRGRI2 35 Gere ce le donelaion 
register 1 

SRGR20 39 SRGR21 49 SRGR22 35 a7. “pane taleigenciaior 
register 2 

MCR10 39 MCR11 49 MCR12 35 08 Multichannel register 1 

MCR20 39 MCR21 49 MCR22 35 09 Multichannel register 2 

RCERAO 39 RCERA1 49 RCERA2 35 Gaye cote cleneleuatle 
register partition A 

RCERBO 39 RCERBi 49 RCERA2 35 ge: MeceWe’cnanhel enable 
register partition B 

XCERAO 39 XCERA1 49 XCERA2 35 60 ero mel enen le 
register partition A 

XCERBO 39 XCERB1 49 XCERA2 35 bo vahemiberannetenave 
register partition B 

PCRO 39 PCR1 49 PCR2 35 OE Pin control register 
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Table 812. °5402/5410/5420 DMA Sub-addressed Registers 


Name 
DMSRCO 
DMDSTO 
DMCTRO 
DMSFCO 
DMMCRO 
DMSRC1 
DMDST1 
DMCTR1 
DMSFC1 
DMMCR1 
DMSRC2 
DMDST2 
DMCTR2 
DMSFC2 
DMMCR2 
DMSRC3 
DMDST3 
DMCTR3 
DMSFC3 
DMMCR3 
DMSRC4 
DMDST4 
DMCTR4 


taccesses to address 57h update the sub-addressed register and post-increment the sub-address contained in 


DMA 


Address 
(Hex) 


56/571 
56/57T 
56/571 
56/57T 
56/571 
56/57t 
56/571 
56/57t 
56/571 
56/57 
56/571 
56/571 
56/571 
56/57t 
56/571 
56/571 
56/571 
56/57 
56/571 
56/571 
56/571 
56/57t 
56/571 


Sub- 
address 
(Hex) 


00 
01 
02 
03 
04 
05 
06 
07 
08 
09 
OA 
0B 
0C 
0D 
OE 
OF 
10 
11 

12 
13 
14 
15 
16 


Description 

DMA channel 0 source address register 

DMA channel 0 destination address register 

DMA channel 0 element count register 

DMA channel 0 sync select and frame count register 
DMA channel 0 transfer mode control register 

DMA channel 1 source address register 

DMA channel 1 destination address register 

DMA channel 1 element count register 

DMA channel 1 sync select and frame count register 
DMA channel 1 transfer mode control register 

DMA channel 2 source address register 

DMA channel 2 destination address register 

DMA channel 2 element count register 

DMA channel 2 sync select and frame count register 
DMA channel 2 transfer mode control register 

DMA channel 3 source address register 

DMA channel 3 destination address register 

DMA channel 3 element count register 

DMA channel 3 sync select and frame count register 
DMA channel 3 transfer mode control register 

DMA channel 4 source address register 

DMA channel 4 destination address register 


DMA channel 4 element count register 


DMSBAR. Accesses to 56h update the sub-addressed register without modifying DMSBAR. 
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Table 8-12. °5402/5410/'5420 DMA Sub-addressed Registers (Continued) 


DMA 


Name 
DMSFC4 


DMMCR4 
DMSRC5 
DMDST5 
DMCTR5 
DMSFC5 
DMMCR5 
DMSRCP 
DMDSTP 
DMIDXO 
DMIDX1 
DMFRIO 
DMFRI1 
DMGSA 
DMGDA 
DMGCR 
DMGFR 


Address 


(Hex) 
56/57t 
56/57t 
56/57t 
56/57t 
56/57t 
56/57t 
56/57t 
56/57t 
56/57t 
56/57t 
56/57t 
56/57t 
56/57t 
56/57t 
56/57t 
56/57t 
56/57t 


Sub- 


address 


(Hex) 
17 
18 
19 
1A 
1B 
1G 
1D 
1E 
1F 
20 
21 
22 
23 
24 
25 
26 
27 


Description 


DMA channel 4 sync select and frame count register 
DMA channel 4 transfer mode control register 

DMA channel 5 source address register 

DMA channel 5 destination address register 

DMA channel 5 element count register 

DMA channel 5 sync select and frame count register 
DMA channel 5 transfer mode control register 

DMA source program page address (common channel) 
DMA destination program page address (common channel) 
DMA element index address register 0 

DMA element index address register 1 

DMA frame index register 0 

DMA frame index register 1 

DMA global source address reload register 

DMA global destination address reload register 

DMA global count reload register 


DMA global frame count reload register 


TtAccesses to address 57h update the sub-addressed register and post-increment the sub-address contained in 
DMSBAR. Accesses to 56h update the sub-addressed register without modifying DMSBAR. 
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8.3 General-Purpose I/O 


The ’54x offers general-purpose I/O through two dedicated pins that are soft- 
ware controlled. The two dedicated pins are the branch control input pin (BIO) 
and the external flag output pin (XF). 


8.3.1. Branch Control Input Pin (BIO) 


BIO can be used to monitor the status of peripheral devices. It is especially 
useful as an alternative to using an interrupt when time-critical loops must not 
be disturbed. A branch can be conditionally executed dependent upon the 
state of the BIO input. Of the instructions that use BIO, the execute conditional- 
ly (XC) instruction samples the condition of BIO during the decode phase of 
the pipeline; all other conditional instructions (branch, call, and return) sample 
BIO during the read phase of the pipeline. 


8.3.2 External Flag Output Pin (XF) 


XF can be used to signal external devices. The XF pin is controlled using soft- 
ware. Itis driven high by setting the XF bit (in ST1) andis driven low by clearing 
the XF bit. The set status register bit (SSBX) and reset status register bit 
(RSBX) instructions can be used to set and clear XF, respectively. XF is also 
set high at device reset. Figure 8—1 shows the relationship between the time 
the SSBX or RSBX instruction is fetched and the time the XF pin is set or reset 
(refer to the ’54x data sheet for timing specifications). The XF timing shown is 
fora sequence of single-cycle instructions. Actual timing can vary with different 
instruction sequences. 


Figure 8—1. External Flag Timing Diagram 


CLKOUT 


SSBX or RSBX instruction Delay >! jq—_ 


prefetch fetch decode access read write 


<> 
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8.4 Timer 


The on-chip timer is a software-programmable timer that consists of three 
registers and can be used to periodically generate interrupts. The timer resolu- 
tion is the CPU clock rate of the processor. The high dynamic range of the timer 
is achieved with a 16-bit counter with a 4-bit prescaler. The 5402 and the 5420 
have two on-chip timers. 


8.4.1 Timer Registers 


The on-chip timer consists of three memory-mapped registers (TIM, PRD, and 
TCR). These three registers and their respective timer addresses are listed in 
Table 8-13. 


Table 8-13. Timer Registers 


Timer 1 
Timer 0 Address 
Address (5402 only) Register Description 
0024h 0030h TIM Timer register 
0025h 0031h PRD Timer period register 
0026h 0032h TCR Timer control register 


_j Timer register (TIM). The 16-bit memory-mapped timer register (TIM) is 
loaded with the period register (PRD) value and decremented. 


_j Timer period register (PRD). The 16-bit memory-mapped timer period reg- 
ister (PRD) is used to reload the timer register (TIM). 


1 Timer control register (TCR). The 16-bit memory-mapped timer control 
register (TCR) contains the control and status bits of the timer. The TCR 
bit fields are shown in Figure 8—2 and described in Table 8-14. 
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Figure 8-2. Timer Control Register (TCR) Diagram 


15-12 11 10 9 4 3-0 


—6 5 
Free PSC tss | __TOOR 


Table 8-14. Timer Control Register (TCR) Bit Summary 


Reset 
Bit Name Value Function 


15-12 Reserved = Reserved; always read as 0. 


11 Soft 0 Used in conjunction with the Free bit to determine the state of the timer when a 
breakpoint is encountered in the HLL debugger. When the Free bit is cleared, the 
Soft bit selects the timer mode. 


Soft = 0 The timer stops immediately. 
Soft = 1 The timer stops when the counter decrements to 0. 


10 Free 0 Used in conjunction with the Soft bit to determine the state of the timer when a 
breakpoint is encountered in the HLL debugger. When the Free bit is cleared, the 
Soft bit selects the timer mode. 


Free =0 The Soft bit selects the timer mode. 
Free = 1 The timer runs free regardless of the Soft bit. 


9-6 PSC — Timer prescaler counter. Specifies the count for the on-chip timer. When PSC is 
decremented past 0 or the timer is reset, PSC is loaded with the contents of TDDR 
and the TIM is decremented. 


5 TRB — Timer reload. Resets the on-chip timer. When TRB is set, the TIM is loaded with 
the value in the PRD and the PSC is loaded with the value in TDDR. TRB is always 
read as a0. 


4 TSS 0 Timer stop status. Stops or starts the on-chip timer. At reset, TSS is cleared and 
the timer immediately starts timing. 


TSS =0 The timer is started. 
TSS = 1 The timer is stopped. 
3-0 TDDR 0000 _ =‘Timer divide-down ratio. Specifies the timer divide-down ratio (period) for the 


on-chip timer. When PSC is decremented past 0, PSC is loaded with the contents 
of TDDR. 
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8.4.2 Timer Operation 


The timer is an on-chip down-counter that can be used to periodically generate 
CPU interrupts. The timer is driven by a prescaler that is decremented by 1 at 
every CPU clock cycle. Each time the counter decrements to 0, a timer 
interrupt (TINT) is generated and the down-counter is reloaded with the period 
value. See section 6.10, Interrupts, on page 6-26, for more details about 
interrupts. 


Figure 8-3 shows a logical block diagram of the timer. It consists of two basic 
blocks: the main timer block, consisting of PRD and TIM; and a prescaler 
block, consisting of TDDR and PSC bits in TCR. The timer is clocked by the 
CPU clock. 


Figure 8-3. Timer Block Diagram 
SRESET 


CPU clock 
TSS 


Borrow Borrow 


> TINT 


> > TOUT 


Under normal operation, TIM is loaded with the contents of PRD when TIM 
decrements to 0. The contents of PRD are also loaded into TIM when the de- 
vice is reset (SRESET input in Figure 8-3) or when the timer is individually re- 
set (TRB input in Figure 8-3). TIM is clocked by the prescaler block. Each out- 
put clock from the prescaler block decrements TIM by 1. The output of the main 
timer block is the timer interrupt (TINT) signal that is sent to the CPU and to 
the timer output (TOUT) pint. The duration of the TOUT pulse is equal to the 
period of CLKOUT. 


T Note that on the ’5402, the timer1 output (TOUT 1) is only available when the HPI-8 is disabled, and the TOUT1 bit is set in the 
GPIO control register. 


On-Chip Peripherals 8-23 


Timer 


The prescaler block has two elements similar to the TIM and PRD. These are 
the prescale counter (PSC) and timer divide-down ratio (TDDR). Both PSC 
and TDDR are fields in the timer control register (TCR). Under normal opera- 
tion, PSC is loaded with the contents of TDDR when PSC decrements to 0. The 
contents of TDDR are also loaded into PSC when the device is reset or when 
the timer is individually reset. PSC is clocked by the device CPU clock. Each 
CPU clock decrements PSC by 1. PSC can be read by reading TCR, but it can- 
not be written to directly. 


The timer can be stopped by making use of the TSS input to turn off the clock 
input to the timer. Stopping the timer’s operation allows the device to run ina 
low-power mode when the timer is not needed. 


The timer interrupt (TINT) rate is equal to the CPU clock frequency divided by 
two independent factors: 


1 = 1 


TINT rate = £5 XUV ~ ty X (TDDR + 1) (PRD + 1) 


In the equation, tg(¢) is the period of CPU clock, uis the sum of the TDDR con- 
tents plus 1, and v is the sum of the PRD contents plus 1. 


The current value in the timer can be read by reading TIM; PSC can be read 
by reading TCR. Because it takes two instructions to read both registers, there 
may be a change between the two reads as the counter decrements. There- 
fore, when precise timing measurements are needed, it is more accurate to 
stop the timer before reading these two values. The timer can be stopped by 
setting the TSS bit and restarted by clearing it. 


The timer can be used to generate a sample clock for peripheral circuits such 
as an analog interface. This can be accomplished by using the TOUTT signal 
to clock a device or by using the interrupt to periodically read a register. 


The timer is initialized with the following steps: 
1) Stop the timer by writing a 1 to TSS in TCR. 


2) Load PRD. 


3) Start the timer by reloading TCR to initialize TDDR. Enable the timer by 
setting TSS to 0 and TRB to 1 to reload the timer period. 


Tt Note that on the 5402, the timer1 output (TOUT 1) is only available when the HPI-8 is disabled, and the TOUT1 bit is set in the 


GPIO control register. 
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Optionally, the timer interrupt may be enabled by (assuming INTM = 1): 
1) Clearing any pending timer interrupts by writing a 1 to TINT in the IFR. 


2) Enabling the timer interrupt by writing a 1 to TINT in the IMR. 


3) Enabling interrupts globally, if necessary, by clearing INTM to 0. 


Atreset, TIM and PRD are set to amaximum value of FFFFh. The timer divide- 
down ratio (TDDR) field of the TCR is cleared to 0 and the timer is started. 
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8.5 Clock Generator 


The clock generator allows system designers to select the clock source. The 
sources that drive the clock generator are: 


Lj Acrystal resonator with the internal oscillator circuit. The crystal resonator 
circuit is connected across the X1 and X2/CLKIN pins of the ‘54x. The 
CLKMD pins must be configured to enable the internal oscillator. 


(1 An external clock. The external clock source is directly connected to the 
X2/CLKIN pin, and X1 is left unconnected. 


The clock generator on the 54x devices consists of an internal oscillator and 
a phase-locked loop (PLL) circuit. Currently, there are two different types of 
PLL circuits on 54x devices. Some devices have hardware-configurable PLL 
circuits while others have software-programmable PLL circuits. The ’541, 
542, 543, 545, and 546 devices use a hardware-configurable PLL. The 
’541B, ’545A, '546A, ’548, 549, 5402, 5410, and 5420 devices use a soft- 
ware-programmable PLL. 


8.5.1 Hardware-Configurable PLL 
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The PLL functions with a lower external frequency source than the machine 
cycle rate of the CPU. This feature reduces high-frequency noise from a high- 
speed switching clock. The internal oscillator or the external clock source is 
fed into the PLL. The internal CPU clock is generated by multiplying the exter- 
nal clock source or the internal oscillator frequency by a factor N (PLL x N). 
If you are using the internal oscillator circuit, the clock source is divided by 2 
to generate the internal CPU clock. If you are using the external clock, the inter- 
nal CPU clock is a factor of PLL x N. 


The PLL has a maximum operating frequency of 40 MHz on a 25-ns ’54x 
device. The PLL requires a transitory locking time of 50 us. The locking time 
is necessary during reset and recovery from the IDLE3 power-down mode. 
See section 6.11, Power-Down Modes, on page 6-50, and section 10.5.2, 
IDLE3, on page 10-26, for more information. 


The clock mode is determined by the CLKMD1, CLKMD2, and CLKMD3 pins. 
Table 8—15 shows how these pins select the clock mode. For non-PLL use, the 
frequency of the CPU clock is half the crystal’s oscillating frequency or the 
external clock frequency. 


The clock mode must not be reconfigured with the clock mode pins during 
normal operation. During IDLE3 mode, the clock mode can be reconfigured 
after CLKOUT is set high. 


Clock Generator 


Table 8-15. Clock Mode Configurations 


Mode Select Pins Clock Modet 
CLKMD1 CLKMD2 CLKMD3 Option 1 Option 2 
0 0 0 PLL x 3 with external source PLL x 5 with external source 
1 1 0 PLL x 2 with external source PLL x 4 with external source 
1 0 0 PLL x 3 with oscillator enabled PLL x 5 with oscillator enabled 
0 1 0 PLL x 1.5 with external source PLL x 4.5 with external source 
0 0 1 Divide-by-2 with external source Divide-by-2 with external source 
1 1 1 Divide-by 2 with oscillator Divide-by-2 with oscillator 
enabled enabled 
1 0 1 PLL x 1 with external source PLL x 1 with external source 
0 1 1 Stop modet Stop modet 


T An individual device is either an Option 1 or Option 2 clock-mode device. 

+ The PLL is disabled. The system clock is not provided to CPU/peripherals. The function of the stop mode is equivalent to that 
of the power-down mode of IDLE3; however, the IDLE 3 instruction is recommended rather than stop mode to realize full power 
saving, since IDLE3 stops clocks synchronously and can be exited with an interrupt. 


8.5.2 Software-Programmable PLL 
(541B/’545A/’546A/’548/’549/’5402/'5410/54208) 


The software-programmable PLL features a high level of flexibility, and 
includes: a clock scaler that provides various clock multiplier ratios, capability 
to directly enable and disable the PLL, and a PLL lock timer that can be used 
to delay switching to PLL clocking mode of the device until lock is achieved. 


Devices that have a built-in software-programmable PLL can be configured in 
one of two clock modes: 


Ly 


L) 


PLL mode. The input clock (CLKIN) is multiplied by 1 of 31 possible ratios 
from 0.25 to 15. These ratios are achieved using the PLL circuitry. 


DIV (divider) mode. The input clock (CLKIN) is divided by 2 or 4. When DIV 
mode is used, all of the analog parts, including the PLL circuitry, are 
disabled in order to minimize power dissipation. 


Immediately following reset, the clock mode is determined by the values of the 
three external pins, CLKMD1, CLKMD2, and CLKMD3. The modes corre- 
sponding to the CLKMD pins are shown in Table 8—16 and Table 8-17. 

§ The VC5420 device does not have CLKMD pins. Following reset, the '5420 operates in bypass mode (PLL is off). 


On-Chip Peripherals 8-27 


Clock Generator 


Table 8-16. Clock Mode Settings at Reset 
(541B/'545A/'546A/'548/'549/'54 10) 


CLKMD 
CLKMD1 CLKMD2 CLKMD3 Reset Value Clock Mode 

0 0 0 0000h Divide-by-2 with external source 

0 0 1 1000h Divide-by-2 with external source 

0 1 0 2000h Divide-by-2 with external source 

1 0 0 4000h Divide-by-2, internal oscillator 
enabled 

1 1 0 6000h Divide-by-2 with external source 

1 1 1 7000h Divide-by-2, internal oscillator 
enabledt 

1 0 1 0007h PLL x 1 with external source 

0 1 1 — Stop mode 

t Reserved on ’549 and '5410 
Table 8-17. Clock Mode Settings at Reset ('5402) 
CLKMD 
CLKMD1 CLKMD2 CLKMD3 Reset Value Clock Mode 

0 0 0 E007h PLL x 15, internal oscillator enabled 

0 0 1 9007h PLL x 10, internal oscillator enabled 

0 1 0 4007h PLL x 5, internal oscillator enabled 

1 0 0 1007h PLL x 2, internal oscillator enabled 

1 1 0 FOO7h PLL x 1, internal oscillator enabled 

1 1 1 0000h 1/2 (PLL disabled), internal oscillator 
enabled 

1 0 1 FOOOh 1/4 (PLL disabled), internal oscillator 
enabled 

0 1 1 _— Reserved (bypass mode) 


Following reset, the software-programmable PLL can be programmed to any 
configuration desired. The following clock mode pin combinations enable the 
PLL during reset: CLKMD (3-1) = 000 — 110 on ’5402, and CLKMD (3-1) = 
101 on all other devices. When these clock mode pin combinations are used, 


Clock Generator 


the internal PLL lock timer is not active; therefore, the system must delay 
releasing reset in order to allow for the PLL lock-time delay. 


The programming of the PLL is loaded in the 16-bit memory-mapped (address 
58h) clock mode register (CLKMD). The CLKMD is used to define the clock 
configuration of the PLL clock module. The CLKMD bit fields are shown in 
Figure 8—4 and described in Table 8-18. Note that upon reset, the CLKMD is 
initialized with a predetermined value dependent only upon the state of the 
CLKMD(1-3) pins (see Table 8-16). 


Figure 8-4. Clock Mode Register (CLKMD) Diagram 


15-12 11 10-3 2 1 0 
PLLMUL PLLDIV PLLCOUNT PLLON/OFF | PLLNDIV | PLLSTATUS 
R/wt R/wWt R/wt R/wt R/W R 


t When in DIV mode (PLLSTATUS is low), PLLMUL, PLLDIV, PLLCOUNT, and PLLON/OFF are don’t cares, and their contents 
are indeterminate. 
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Table 8-18. Clock Mode Register (CLKMD) Bit Summary 


Bit Name 
15-12 PLLMUL 


11 PLLDIV 


10-3. PLLCOUNT 


2 PLLON/OFF 


i PLLNDIV 


0 PLLSTATUS 
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Function 


PLL multiplier. Defines the frequency multiplier in conjunction with PLLDIV and 
PLLNDIV, as shown in Table 8-19 on page 8-31. 


PLL divider. Defines the frequency multiplier in conjunction with PLLMUL and 
PLLNDIV, as shown in Table 8-19 on page 8-31. 


PLL counter value. Specifies the number of input clock cycles (in increments of 
16 cycles) forthe PLL lock timer to count before the PLL begins clocking the processor 
after the PLL is started. The PLL counter is a down-counter, which is driven by the 
input clock divided by 16; therefore, for every 16 input clocks, the PLL counter 
decrements by 1. See section Using the PLLCOUNT Programmable Lock Timer, on 
page 8-31 for more information about PLLCOUNT. 


The PLL counter can be used to ensure that the processor is not clocked until the PLL 
is locked, so that only valid clock signals are sent to the device. 


PLL on/off. Enables or disables the PLL part of the clock generator in conjunction with 
PLLNDIV. PLLON/OFF and PLLNDIV both force the PLL to operate; when PLLON/ 
OFF is high, the PLL runs independently of the state of PLLNDIV: 


PLLON/OFF PLLNDIV PLL State 
0 0 off 
0 1 on 
1 0 on 
1 1 on 


PLL clock generator select. Determines whether the clock generator works in PLL 
mode or in divider (DIV) mode, thus defining the frequency multiplier in conjunction 
with PLLMUL and PLLDIV. 

PLLNDIV = 0 Divider (DIV) mode is used. 
PLLNDIV = 1 PLL mode is used. 

PLL status. Indicates the mode that the clock generator is operating. 
PLLSTATUS = 0 Divider (DIV) mode 


PLLSTATUS = 1 PLL mode 
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Table 8-19. PLL Multiplier Ratio as a Function of PLLNDIV, PLLDIV, and PLLMUL 


PLLNDIV PLLDIV PLLMUL Multipliert 
0 X 0-14 0.5 
0 x 15 0.25 
1 0 0-14 PLLMUL + 1 
1 0 15 1 (bypass)t 
1 1 0 or even (PLLMUL + 1) + 2 
1 1 odd PLLMUL + 4 


t CLKOUT = CLKIN x Multiplier 
+ This is the default mode for the ’5420 after reset. 


Programming Considerations When Using the Software-Programmable PLL 


The software-programmable PLL offers many different options in startup 
configurations, operating modes, and power-saving features. Programming 
considerations and several software examples are presented here to illustrate 
the proper use of the software-programmable PLL at start-up, when switching 
between different clocking modes, and before and after IDLE 1/IDLE 2/IDLE 3 
instruction execution. 


Using the PLLCOUNT Programmable Lock Timer 


During the lockup period, the PLL should not be used to clock the ’54x. The 
PLLCOUNT programmable lock timer provides a convenient method of auto- 
matically delaying clocking of the device by the PLL until lock is achieved. 


The PLL lock timer is a counter, loaded from the PLLCOUNT field in the 
CLKMD register, that decrements from its preset value to 0. The timer can be 
preset to any value from 0 to 255, and its input clock is CLKIN divided by 16. 
The resulting lockup delay can therefore be set from 0 to 255 x 16 CLKIN 
cycles. 


The lock timer is activated when the clock generator operating mode is 
switched from DIV to PLL (see section Switching From DIV Mode to PLL Mode, 
on page 8-32). During the lockup period, the clock generator continues to op- 
erate in DIV mode; after the PLL lock timer has decremented to 0, the PLL be- 
gins clocking the ’54x. 
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The decimal preset value, PLLCOUNT, is: 


LockupTime 


PLLCOUNT > 
16 x Toxkin 


where Tc kin is the input reference clock period and LockupTime is the 
required PLL lockup time as shown in Figure 8-5. 


Figure 8-5. PLL Lockup Time Versus CLKOUT Frequency 
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Switching Clock Mode From DIV Mode to PLL Mode 
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Several circumstances may require switching from DIV mode to PLL mode; 
however, note that if the PLL is not locked when switching from DIV mode to 
PLL mode, the PLL lockup time delay must be observed before the mode 
switch occurs to ensure that only proper clock signals are sent to the device. 
It is, therefore, important to know whether or not the PLL is locked when 
switching operating modes. 


The PLL is unlocked on power-up, after changing the PLLMUL or PLLDIV 
values, after turning off the PLL (PLLON/OFF = 0), or after loss of input refer- 
ence clock. Once locked, the PLL remains locked even in DIV mode as long 
as the PLL had been previously locked and has not been turned off (PLLON/ 
OFF stays 1), and the PLLMUL and PLLDIV values have not been changed 
since the PLL was locked. 


Switching from DIV mode to PLL mode (setting PLLNDIV to 1) activates the 
PLLCOUNT programmable lock timer (when PLLCOUNT is preloaded with a 


Clock Generator 


nonzero value) and this can be used to provide a convenient method for imple- 
menting the lockup time delay. The PLLCOUNT lock timer feature should be 
used in the previously described situations where the PLL is unlocked unless 
a reset delay is used to implement the lockup delay, or the PLL is not used. 


Switching from DIV mode to PLL mode is accomplished by loading CLKMD. 
The following procedure describes switching from DIV mode to PLL mode 
when the PLL is not locked. When performing this mode switch with the PLL 
already locked, the effect is the same as when switching from PLL mode to DIV 
mode, but in the reverse order. In this case, the delays of when the new clock 
mode takes effect are the same. 


When switching from DIV mode to PLL mode with the PLL unlocked, or when 
the mode change will result in unlocked operation, the PLLMUL, PLLDIV, and 
PLLNDIV bits are set to select the desired frequency multiplier as shown in 
Table 8-19 on page 8-31, and the PLLCOUNT bits are set to select the 
required lockup time delay. Note that PLLMUL, PLLDIV, PLLCOUNT, and 
PLLON/OFF can only be modified when in DIV mode. 


Once the PLLNDIV bit is set, the PLLCOUNT timer begins being decremented 
from its preset value. When the PLLCOUNT timer reaches 0, the switch to PLL 
mode takes effect after 6 CLKIN cycles plus 3.5 PLL cycles. When the switch 
to PLL mode is completed, the PLLSTATUS bit in CLKMD is read as 1. Note 
that during the PLL lockup period, the 54x continues operating in DIV mode. 


The following code can be used to switch from DIV mode to PLL x 3 mode, 
with a CLKIN frequency of 138 MHz and PLLCOUNT = 41 (decimal): 


STM #0010000101001111b, CLKMD 


Switching Clock Mode From PLL Mode to DIV Mode 


When switching from PLL mode to DIV mode, the PLLCOUNT delay does not 
occur and the switch between the two modes takes place after a short transi- 
tion delay. 


The switch from PLL mode to DIV mode is also accomplished by loading 
CLKMD. The PLLNDIV bit is cleared to 0, selecting DIV mode, and the 
PLLMUL bits are set to select the desired frequency multiplier as shown in 
Table 8-19 on page 8-31. 


The switch to DIV mode takes effect in 6 CLKIN cycles plus 3.5 PLL cycles for 
all PLLMUL values except 1111b. For a PLLMUL value of 1111b, the switch to 
DIV mode takes effect in 12 CLKIN cycles plus 3.5 PLL cycles. When the 
switch to DIV mode is completed, the PLLSTATUS bit in CLKMD is read as 0. 


Example 8—1 shows acode sequence that can be used to switch from PLL x 3 
mode to divide-by-2 mode. Note that the PLLSTATUS bit is polled to determine 
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when the switch to DIV mode has taken effect, and then the STM instruction 
is used to turn off the PLL at this point. 


Example 8-1. Switching Clock Mode From PLL x 3 Mode to Divide-by-2 Mode 


STM #0b, CLKMD ;switch to DIV mode 
TstStatu: LDM CLKMD, A 


AND #01b, A ;poll STATUS bit 
BC TstStatu, ANEQ 
STM #0b, CLKMD j;reset PLLON/OFF when STATUS 


;is DIV mode 


Changing the PLL Multiplier Ratio 
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When switching from one PLL multiplier ratio to another multiplier ratio is 
required, the clock generator must first be switched from PLL mode to DIV 
mode before selecting the new multiplier ratio; switching directly from one PLL 
multiplier ratio to another multiplier ratio is not supported. 


In order to switch from one PLL multiplier ratio to another multiplier ratio, the 
following steps must be followed: 


1) Clear the PLLNDIV bit to 0, selecting DIV mode. 


2) Poll the PLLSTATUS bit until a 0 is obtained, indicating that DIV mode is 
enabled. 


3) Modify CLKMD to set the PLLMUL, PLLDIV, and PLLNDIV bits to the 
desired frequency multiplier as shown in Table 8—19 on page 8-31 


4) Set the PLLCOUNT bits to the required lock-up time. 


Once the PLLNDIV bit is set, the PLLCOUNT timer begins being decremented 
from its preset value. When the PLLCOUNT timer reaches 0, the new PLL 
mode takes effect after 6 CLKIN cycles plus 3.5 PLL cycles. 


Note that a direct switch between divide-by-2 mode and divide-by-4 mode is 
not possible. To switch between these two modes, the clock generator must 
first be set to PLL mode with an integer-only (nonfractional) multiplier ratio and 
then set back to DIV mode in the desired divider configuration (see section 
Switching From DIV Mode to PLL Mode, on page 8-32). 


Example 8-2 shows a code sequence that can be used to switch the clock 
mode from PLL x X mode to PLL x 1 mode. 
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Example 8-2. Switching Clock Mode From PLL x X Mode to PLL x 1 Mode 


STM #0b, CLKMD ;switch to DIV mode 
TstStatu: LDM CLKMD, A 

AND #0lb, A ;poll STATUS bit 

BC TstStatu, ANEQ 


STM #0000001111101111b, CLKMD ;switch to PLL X 1 mode 


PLL Operation Immediately Following Reset 


Immediately following reset, the clock mode is determined by the values of the 
three external pins, CLKMD1, CLKMD2, and CLKMD3. Switching from the ini- 
tial clock mode to any other mode can easily be accomplished by changing the 
contents of CLKMD. If use of the internal oscillator with an external crystal is 
desired, the device CLKMD pins must be configured at reset to enable the in- 
ternal oscillator. See Table 8—16 and Table 8-17 on page 8-28 for external pin 
values and available modes on each device. (The internal oscillator option is 
not available on the 5420.) 


The following code can be used to switch from divide-by-2 mode to 
PLL x 3 mode: 


STM #0010000101001111b, CLKMD 


PLL Considerations When Using IDLE Instruction 


When using one of the IDLE instructions to reduce power requirements, 
proper management of the PLL is important. The clock generator consumes 
the least power when operating in DIV mode with the PLL disabled. Therefore, 
if power dissipation is a significant consideration, it is desirable to switch from 
PLL mode to DIV mode and disable the PLL, before executing an IDLE 1, 
IDLE 2, or IDLE 3 instruction. This is accomplished as explained in section 
Switching From PLL Mode to DIV Mode, on page 8-33. After waking up from 
IDLE1/IDLE2/IDLE9, the clock generator can be reprogrammed to PLL mode 
as explained in section Switching From DIV Mode to PLL Mode, on page 8-32. 


Note that when the PLL is stopped during an IDLE state and the ’54x device 
is restarted and the clock generator is switched back to PLL mode, the PLL 
lockup delay occurs in the same manner as in a normal device startup. There- 
fore, in this case, the lockup delay must also be accounted for, either externally 
or by using the PLL lockup counter timer. 


Example 8-3 shows a code sequence that switches the clock generator from 
PLL x 3 mode to divide-by-2 mode, turns off the PLL, and enters IDLE3. After 
waking up from IDLE3, the clock generator is switched from DIV mode to 
PLL x 3 mode using a single STM instruction, with a PLLCOUNT of 64 (deci- 
mal) used for the lock timer value. 
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Example 8-3. Switching Clock From PLL x 3 Mode to Divide-by-2 Mode, 
Turning Off the PLL, and Entering IDLE3 


STM #0b, CLKMD ; switch to DIV mode 
TstStatu: LDM CLKMD, A 
AND #01lb, A ;poll STATUS bit 
BC TstStatu, ANEQ 
STM #0b, CLKMD ;reset PLLON_OFF when STATUS 
jis DIV mode 
IDLE3 
(After IDLE3 wake-up - switch the PLL from DIV mode to PLL X 3 mode) 
STM #0010001000000111b, CLKMD ;PLLCOUNT = 64 (decimal) 


PLL Considerations When Using the Bootloader 
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The ROM on the ’545A and ’546A contains a bootloader program that can be 
used to load programs into RAM for execution following reset. When using this 
bootloader with the software-programmable PLL, several considerations are 
important for proper system operation. 


On the 545A and ’546A, for compatibility, the bootloader configures the PLL 
to the same mode as would have resulted if the same CLKMD(1-3) input bits 
had been provided to the option-1 or option-2 hardware-programmable PLL 
(see Table 8-15 on page 8-27), according to whether the 545A or 546A is an 
option-1 or option-2 device. Once the bootloader program has finished 
executing and control is transferred to the user’s program, the PLL can be 
reprogrammed to any desired configuration. 


Host Port Interface 


8.6 Host Port Interface 


The standard host port interface (HPI) is available on the 542, 545, 548, and 
549 devices. The HPI is an 8-bit parallel port that interfaces a host device or 
host processor to the 54x. Information is exchanged between the 54x and the 
host device through on-chip 54x memory that is accessible by both the host 
and the ’54x. 


Enhanced host port interfaces are available on the 5402, 5410 (HPI-8) and 
5420 (HPI-16) devices. This chapter does not describe these enhanced HPIs. 
For more information on the HPI-8 and HPI-16, see volume 5 of this reference 
set: TMS320C54x DSP, Enhanced Peripherals, literature number SPRU302. 


The HPI interfaces to the host device as a peripheral, with the host device as 
master of the interface, facilitating ease of access by the host. The host device 
communicates with the HPI through dedicated address and data registers, to 
which the ’54x does not have direct access, and the HPI control register, using 
the external data and interface control signals (see Figure 8-6). Both the host 
device and the ’54x have access to the HPI control register. 


Figure 8-6. Host Port Interface Block Diagram 
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The HPI provides 16-bit data to the 54x while maintaining the economical 8-bit 
external interface by automatically combining successive bytes transferred 
into 16-bit words. When the host device performs a data transfer with the HPI 
registers, the HPI control logic automatically performs an access to a dedi- 
cated 2K-word block of internal 54x dual-access RAM to complete the trans- 
action. The 54x can then access the data within its memory space. The HPI 
RAM can also be used as general-purpose dual-access data or program RAM. 


The HPI has two modes of operation, shared-access mode (SAM) and host- 
only mode (HOM). In shared-access mode (the normal mode of operation), 
both the ’54x and the host can access HPI memory. In this mode, asynchro- 
nous host accesses are resynchronized internally and, in the case of a conflict 
between a ’54x and a host cycle (where both accesses are reads or writes), 
the host has access priority and the 54x waits one cycle. In host-only mode, 
only the host can access HPI memory while the ’54x is in reset or in IDLE2 with 
all internal and external clocks stopped. The host can therefore access the HPI 
RAM while the ’54x is in its minimum power consumption configuration. 


The HPI supports high speed, back-to-back host accesses. In shared-access 
mode, the HPI can transfer one byte every five CLKOUT cycles (that is, 
64M bps) with the 54x running at a40-MHz CLKOUT. The HPI is designed so 
the host can take advantage of this high bandwidth and run at frequencies up 
to (Fd*n)/5, where Fd is the 54x CLKOUT frequency and n is the number of 
host cycles for an external access. Therefore, with a 40-MHz 54x and com- 
mon values of 4 (or 3) for n, the host can run at speeds of up to 32 (or 24) MHz 
without requiring wait states. In the host-only mode, the HP! supports even 
higher speed back-to-back host accesses on the order of one byte every 50 ns 
(that is, 160M bps), independent of the ‘54x clock rate (refer to the 
TMS320C54x data sheet for specific detailed timing information). 


8.6.1. Basic Host Port Interface Functional Description 
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The external HPI interface consists of the 8-bit HP! data bus and control sig- 
nals that configure and control the interface. The interface can connect to a 
variety of host devices with little or no additional logic necessary. Figure 8—7 
shows a simplified diagram of a connection between the HPI and a host 
device. 


Host Port Interface 


Figure 8—7. Generic System Block Diagram 
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The 8-bit data bus (HDO—HD7) exchanges information with the host. Because 
of the 16-bit word structure of the ’54x, all transfers with a host must consist 
of two consecutive bytes. The dedicated HBIL pin indicates whether the first 
or second byte is being transferred. An internal control register bit determines 
whether the first or second byte is placed into the most significant byte of a 
16-bit word. The host must not break the first byte/second byte (HBIL low/high) 
sequence of an ongoing HPI access. If this sequence is broken, data can be 
lost, and unpredictable operation can result. 


The two control inputs (HCNTLO and HCNTL1) indicate which internal HPI 
register is being accessed and the type of access to the register. These inputs, 
along with HBIL, are commonly driven by host address bus bits or a function 
of these bits. Using the HCNTLO/1 inputs, the host can specify an access to 
the HPI control (HPIC) register, the HPI address (HPIA) register (which serves 
as the pointer into HPI memory), or HPI data (HPID) register. The HPID regis- 
ter can also be accessed with an optional automatic address increment. 


The autoincrement feature provides a convenient way of reading or writing to 
subsequent word locations. In autoincrement mode, a data read causes a 
postincrement of the HPIA, and a data write causes a preincrement of the 
HPIA. By writing to the HPIC, the host can interrupt the ‘54x CPU, and the HINT 
output can be used by the ’54x to interrupt the host. The host can also acknowl- 
edge and clear HINT by writing to the HPIC. 
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Table 8-20 summarizes the three registers that the HPI utilizes for commu- 
nication between the host device and the 54x CPU and their functions. 


Table 8-20. HPI Registers Description 


Name 


HPIA 


Address 


Description 


HPI address register. Directly accessible only by the host. Contains the address in 


HPIC 


HPID 


002Ch 


the HPI memory at which the current access occurs. 


HPI control register. Directly accessible by either the host or by the 54x. Contains 
control and status bits for HPI operations. 


HPI data register. Directly accessible only by the host. Contains the data that was 
read from the HP! memory if the current access is a read, or the data that will be 


written to HPI memory if the current access is a write. 


The two data strobes (HDS1 and HDS2), the read/write strobe (HR/W), and 
the address strobe (HAS) enable the HPI to interface to a variety of industry- 
standard host devices with little or no additional logic required. The HPI is easi- 
ly interfaced to hosts with multiplexed address/data bus, separate address 
and data buses, one data strobe and a read/write strobe, or two separate 
strobes for read and write. This is described in detail later in this section. 


The HPI ready pin (HRDY) allows insertion of wait states for hosts that support 
a ready input to allow deferred completion of access cycles and have faster 
cycle times than the HPI can accept due to 54x operating clock rates. If HRDY, 
when used directly from the ’54x, does not meet host timing requirements, the 
signal can be resynchronized using external logic if necessary. HRDY is useful 
when the ’54x operating frequency is variable, or when the host is capable of 
accessing ata faster rate than the maximum shared-access mode access rate 
(up to the host-only mode maximum access rate). In both cases, the HRDY 
pin provides a convenient way to automatically (no software handshake need- 
ed) adjust the host access rate to a faster ’54x clock rate or switch the HPI 
mode. 


All of these features combined allow the HPI to provide a flexible and efficient 
interface to a wide variety of industry-standard host devices. Also, the simplic- 
ity of the HPI interface greatly simplifies data transfers both from the host and 
the ‘54x sides of the interface. Once the interface is configured, data transfers 
are made with a minimum of overhead at a maximum speed. 
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8.6.2 Details of Host Port Interface Operation 


Table 8-21. 


HPI Pin 
HAS 


HBIL 


HDO-HD7 


T= Input, O = 


This section includes a detailed description of each HPI external interface pin 
function, as well as descriptions of the register and control bit functions. Log- 
ical interface timings and initialization and read/write sequences are dis- 
cussed in section 8.6.3, Host Read/Write Access to HPI, on page 8-46. 


The external HPI interface signals implement a flexible interface to a variety 
of types of host devices. Devices with single or multiple data strobes and with 
or without address latch enable (ALE) signals can easily be connected to the 


HPI. 


Table 8—21 gives a detailed description of the function of each of the HPI exter- 
nal interface pins. 


HPI Signal Names and Functions 
Host Pin Statet Signal Function 
Address latch | Address strobe input. Hosts with a multiplexed address and data bus 


enable (ALE) or 
Address strobe or 
unused (tied high) 


Address or control 
lines 


Address or control 


lines 


Address or control 
lines 


Data bus VO/Z 


Output, Z = High impedance 


connect HAS to their ALE pin or equivalent. HBIL, HCNTLO/1, and 
HR/W are then latched on HAS falling edge. When used, HAS must 
precede the later of HCS, HDS1, or HDS2 (see ’54x data sheet for 
detailed HPI timing specifications). Hosts with separate address and 
data bus can connect HAS to a logic-1 level. In this case, HBIL, 
HCNTLO/1, and HR/W are latched by the later of HDS1, HDS2, or 
HCS falling edge while HAS stays inactive (high). 


Byte identification input. Identifies first or second byte of transfer (but 
not most significant or least significant — this is specified by the BOB 
bit in the HPIC register, described later in this section). HBIL is low 
for the first byte and high for the second byte. 


Host control inputs. Selects a host access to the HPIA register, the 
HPI data latches (with optional address increment), or the HPIC 
register. 


Chip select. Serves as the enable input for the HPI and must be low 
during an access but may stay low between accesses. HCS normally 
precedes HDS1 and HDS2, but this signal also samples HCNTLO/1, 
HR/W, and HBIL if HAS is not used and HDS1 or HDS2 are already 
low (this is explained in further detail later in this section). Figure 8-8 
on page 8-43 shows the equivalent circuit of the HCS, HDS1 and 
HDS2 inputs. 


eae bidirectional 3-state data bus. HD7 (MSB) through HDO 
(LSB) are placed in the high-impedance state when not outputting 
(HDSx | HCS = 1) or when EMU1/OFF is active (low). 
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Table 8-21. 


HPI Pin 


HINT 


HRDY 


HR/W 


Host Pin StateT 


Read strobe and | 
write strobe or 
data strobe 


Host interrupt O/Z 
input 


Asynchronous O/Z 
ready 


Read/Write strobe, 
address line, or 
multiplexed 
address/data 


tI = Input, O = Output, Z = High impedance 
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HPI Signal Names and Functions (Continued) 


Signal Function 


Data strobe inputs. Control transfer of data during host access 
cycles. Also, when_HAS is_not used, used to sample HBIL, 
HCNTLO/1, and HR/W when HCS is already low (which is the case 
in normal operation). Hosts with separate read and write strobes 
connect those strobes to either HDS1 or HDS2. Hosts with a single 
data strobe connect it to either HDS1 or HDS2, connecting the 
unused pin high. Regardless of HDS connections, HR/W is still 
required to determine direction of transfer. Because HDS1 and HDS2 
are internally exclusive-NORed, hosts with a high true data strobe 
can connect this to one of the HDS inputs with the other HDS input 
connected low. Figure 8-8 on page 8-43 shows the equivalent circuit 
of the HDS1, HDS2, and HCS inputs. 


Host interrupt output. Controlled by the HINT bit in the HPIC. Driven 
high when the ’54x is being reset. Placed in the high impedance state 
when EMU1/OFF is active (low). 


HPI ready output. When high, indicates that the HPI is ready for a 
transfer to be performed. When low, indicates that the HPI is busy 
completing the internal portion of the previous transaction. Placed in 
high impedance when EMU1/OFF is active (low). HCS enables 
HRDY; that is, HRDY is always high when HGS is high. 


Read/write input. Hosts must drive HR/W high to read HPI! and low 
to write HPI. Hosts without a read/write strobe can use an address 
line for this function. 


The HGS input serves primarily as the enable input for the HPI, and the HDS1 
and HDS2 signals control the HPI data transfer; however, the logic with which 
these inputs are implemented allows their functions to be interchanged if de- 
sired. If HCS is used in place of HDS1 and HDS2 to control HPI access cycles, 
HRDY operation is affected (since HCS enables HRDY and HRDY is always 
high when HCS is high). The equivalent circuit for these inputs is shown in 
Figure 8—8. The figure shows that the internal strobe signal that samples the 
HCNTLO/1, HBIL, and HR/W inputs (when HAS is not used) is derived from all 
three of the input signals, as the logic illustrates. Therefore, the latest of HDS1, 
HDS2, or HCS is the one which actually controls sampling of the HCNTLO/1, 
HBIL, and HR/W inputs. Because HDS1 and HDS2 are exclusive-NORed, 
both these inputs being low does not constitute an enabled condition. 
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Figure 8-8. Select Input Logic 
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When using the HAS input to sample HCNTLO/1, HBIL, and HR/W,, this allows 
these signals to be removed earlier in an access cycle, therefore allowing more 
time to switch bus states from address to data information, facilitating interface 
to multiplexed address and data type buses. In this type of system, an ALE sig- 
nal is often provided and would normally be the signal connected to HAS. 


The two control pins (HCNTLO and HCNTL1) indicate which internal HPI regis- 
ter is being accessed and the type of access to the register. The states of these 
two pins select access to the HPI address (HPIA), HPI data (HPID), or HPI 
control (HPIC) registers. The HPIA register serves as the pointer into HPI 
memory, the HPIC contains control and status bits for the transfers, and the 
HPID contains the actual data transferred. Additionally, the HPID register can 
be accessed with an optional automatic address increment. Table 8-22 
describes the HCNTLO/1 bit functions. 


Table 8-22. HPI Input Control Signals Function Selection Descriptions 


HCNTL1 
0 


0 


HCNTLO Description 


0 
1 


Host can read or write the HPI control register, HPIC. 


Host can read or write the HPI data latches. HPIA is automatically postincremented each 
time a read is performed and preincremented each time a write is performed. 


Host can read or write the address register, HPIA. This register points to the HPI 
memory. 


Host can read or write the HPI data latches. HPIA is not affected. 


On the 54x, HPI memory is a2K x 16-bit word block of dual-access RAM that 
resides at 1000h to 17FFh in data memory space and optionally, depending 
on the state of the OVLY bit, in program memory space. 


From the host interface, the 2K-word block of HPI memory can conveniently 
be accessed at addresses 0 through 7FFh; however, the memory can also be 
accessed by the host starting with any HPIA values with the 11 LSBs equal to 
0. For example, the first word of the HPI memory block, addressed at 1000h 
by the ’54x in data memory space, can be accessed by the host with any of 
the following HPIA values: 0000h, 0800h,1000h,1800h, --- F800h. 
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The HPI autoincrement feature provides a convenient way of accessing con- 
secutive word locations in HPI memory. In the autoincrement mode, a data 
read causes a postincrement of the HPIA, and a data write causes a preincre- 
ment of the HPIA. Therefore, if a write is to be made to the first word of HPI 
memory with the increment option, due to the preincrement nature of the write 
operation, the HPIA should first be loaded with any of the following values: 
O7FFh, OFFFh, 17FFh, --- FFFFh. The HPIA is a 16-bit register and all 16 bits 
can be written to or read from, although with a 2K-word HPI memory imple- 
mentation, only the 11 LSBs of the HPIA are required to address the HPI 
memory. The HPIA increment and decrement affect all 16 bits of this register. 


HPI Control Register Bits and Function 


Four bits control HPI operation. These bits are BOB (which selects first or se- 
cond byte as most significant), SMOD (which selects host or shared-access 
mode), and DSPINT and HINT (which can be used to generate ’54x and host 
interrupts, respectively) and are located in the HPI control register (HPIC). A 
detailed description of the HPIC bit functions is presented in Table 8-23. 


Table 8-23. HPI Control Register (HPIC) Bit Descriptions 


Bit 
BOB 


SMOD 
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Host Access 
Red/Write 


Read 


*b4x Access Description 


- If BOB = 1, first byte is least significant. If BOB = 0, first byte is most 
significant. BOB affects both data and address transfers. Only the 
host can modify this bit and it is not visible to the 54x. BOB must be 
initialized before the first data or address register access. 


Read/Write If SMOD = 1, shared-access mode (SAM) is enabled: the HPI 
memory can be accessed by the ’54x. If SMOD = 0, host-only mode 
(HOM) is enabled: the 54x is denied access to the entire HPI RAM 
block. SMOD = 0 during reset; SMOD = 1 after reset. SMOD can be 
modified only by the ’54x but can be read by both the ’54x and the 
host. 
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Table 8-23. HPI Control Register (HPIC) Bit Descriptions (Continued) 


Bit Host Access ’°54x Access _ Description 
DSPINT Write - The host processor-to-’54x interrupt. This bit can be written only by 


the host and is not readable by the host or the 54x. When the host 
writes a 1 to this bit, an interrupt is generated to the 54x. Writing a 
0 to this bit has no effect. Always read as 0. When the host writes 
to HPIC, both bytes must write the same value. See this section for 
a detailed description of DSPINT function. 


HINT Read/Write Read/Write — This bit determines the state of the 54x HINT output, which can be 
used to generate an interrupt to the host. HINT = 0 upon reset, which 
causes the external HINT output to be inactive (high). The HINT bit 
can be set only by the ’54x and can be cleared only by the host. The 
‘54x writes a 1 to HINT, causing the HINT pin to go low. The HINT 
bit is read by the host or the ’54x as a 0 when the external HINT pin 
is inactive (high) and as a 1 when the HINT pin is active (low). For 
the host to clear the interrupt, however, it must write a 1 to HINT. 
Writing a 0 to the HINT bit by either the host or the 54x has no effect. 
See this section for a detailed description of HINT function. 


Because the host interface always performs transfers with 8-bit bytes and the 
control register is normally the first register accessed to set configuration bits 
and initialize the interface, the HPIC is organized on the host side as a 16-bit 
register with the same high and low byte contents (although access to certain 
bits is limited, as described previously) and with the upper bits unused on the 
’B4x side. The control/status bits are located in the least significant four bits. 
The host accesses the HPIC register with the appropriate selection of 
HCNTLO/1, as described previously, and two consecutive byte accesses to the 
8-bit HP! data bus. When the host writes to HPIC, both the first and second 
byte written must be the same value. The ’54x accesses the HPIC at 002Ch 
in data memory space. 


The layout of the HPIC bits is shown in Figure 8-9 through Figure 8-12. In the 
figures for read operations, if 0 is specified, this value is always read; if X is 
specified, an unknown value is read. For write operations, if X is specified, any 
value can be written. On a host write, both bytes must be identical. Note that 
bits 4-7 and 12-15 on the host side and bits 4-15 on the ’54x side are reserved 
for future expansion. 


Figure 8-9. HPIC Diagram — Host Reads from HPIC 


15-12 11 10 7 1 0 


9 8 3 2 
x [nwt] o [swoo feos] x [wr] o [soo [208 | 


Note: X= Unknown value is read. 
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Figure 8—10. HPIC Diagram — Host Writes to HPIC 
15-12 11 10 9 8 7-4 3 2 1 0 

Note: X = Any value can be written. 
Figure 8—11.HPIC Diagram — TMS320C54x Reads From HPIC 

15-4 a 2 1 0 
xine | o [ston | 0 
Note: X = Unknown value is read. 
Figure 8—12. HPIC Diagram — TMS320C54x Writes to HPIC 

15-4 


3 2 1 0 


Note: X = Any value can be written. 


Because the 54x can write to the SMOD and HINT bits, and these bits are read 
twice on the host interface side, the first and second byte reads by the host may 
yield different data if the 54x changes the state of one or both of these bits in 
between the two read operations. The characteristics of host and ‘54x HPIC 
read/write cycles are summarized in Table 8-24. 


Table 8-24. HPIC Host/TMS320C54x Read/Write Characteristics 


Device Read Write 
Host 2 bytes 2 bytes (Both bytes must be equal) 
’BAx 16 bits 16 bits 


8.6.3 Host Read/Write Access to HPI 


The host begins HPI accesses by performing the external interface portion of 
the cycle; that is, initializing first the HPIC register, then the HPIA register, and 
then writing data to or reading data from the HPID register. Writing to HPIA or 
HPID initiates an internal cycle that transfers the desired data between the 
HPID and the dedicated internal HPI memory. Because this process requires 
several 54x cycles, each time an HPI access is made, data written to the HPID 
is not written to the HPI memory until after the host access cycle, and the data 
read from the HPID is the data from the previous cycle. Therefore, when read- 
ing, the data obtained is the data from the location specified in the previous 
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access, and the current access serves as the initiation of the next cycle. A simi- 
lar sequence occurs for awrite operation: the data written to HPID is not written 
to HPI memory until after the external cycle is completed. If an HPID read op- 
eration immediately follows an HPID write operation, the same data (the data 
written) is read. 


The autoincrement feature available for HPIA results in sequential accesses 
to HPI memory by the host being extremely efficient. During random 
(nonsequential) transfers or sequential accesses with a significant amount of 
time between them, itis possible that the 54x may have changed the contents 
of the location being accessed between a host read and the previous host data 
read/write or HPIA write access, because of the prefetch nature of internal HPI 
operation. If this occurs, data different from the current memory contents may 
be read. Therefore, in cases where this is of concern in a system, two reads 
from the same address or an address write prior to the read access can be 
made to ensure that the most recent data is read. 


When the host performs an external access to the HPI, there are two distinctly 
different types of cycles that can occur: those for which wait states are gener- 
ated (the HRDY signal is active) and those without wait states. In general, 
when in shared-access mode (SAM), the HRDY signal is used; when in host- 
only mode (HOM), HRDY is not active and remains high; however, there are 
exceptions to this, which will be discussed. 


For accesses utilizing the HRDY signal, during the time when the internal por- 
tion of the transfer is being performed (either for a read or a write), HRDY is 
low, indicating that another transfer cannot yet be initiated. Once the internal 
cycle is completed and another external cycle can begin, HRDY is driven high 
by the HPI. This occurs after a fixed delay following a cycle initiation (refer to 
the 54x data sheet for detailed timing information for HPI external interface 
timings). Therefore, unless back-to-back cycles are being performed, HRDY 
is normally high when the first byte of a cycle is transferred. The external HPI 
cycle using HRDY is shown in the timing diagram in Figure 8-13. 
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Figure 8-13. HPI Timing Diagram 


| Byte 1 | Byte 2 | 


HCNTLO/1 SXXXXXXn j AXXXKXXXKXN . COCOCOOCCOON 
MRI SSRSARDK Val RIIAAAIK Vil RAAAA RAED 


HBL OOD) LEQ OY RY 
ee XX XXX XXX) 
ns SOON 
HAS sf a a 
(ifused) YW. — JS YW. — —_/J/ YL. 


HD XXX) : XXX) : 
fo ————__ vaio. >» QR" vaia_)___—_— 
HD 


In a typical external access, as shown in Figure 8—13, the cycle begins with 
the host driving HCNTLO/1, HR/W, HBIL, and HCS, indicating specifically what 
type of transfer is to occur and whether the cycle is to be read or a write. Then 
the host asserts the HAS signal (if used) followed by one of the data strobe 
signals. If HRDY is not already high, it goes high when the previous internal 
cycle is complete, allowing data to be transferred, and the control signals are 
de-asserted. Following the external HPI cycle, HRDY goes low and stays low 
for a period of approximately five CLKOUT cycles (refer to the 54x data sheet 
for HPI timing information) while the ‘54x completes the internal HPI memory 
access, and then HRDY is driven high again. Note, however, HRDY is always 
high when HGS is high. 


As mentioned previously, SAM accesses generally utilize the HRDY signal. 
The exception to the HRDY-based interface timings when in SAM occurs when 
reading HPIC or HPIA or writing to HPIC (except when writing 1 to either 
DSPINT or HINT). In these cases, HRDY stays high; for all other SAM 
accesses, HRDY is active. 
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Host access cycles, when in HOM, have timings different from the SAM tim- 
ings described previously. In HOM, the CPU is not involved (with one excep- 
tion), and the access can be completed after a short, fixed delay time. The ex- 
ception to this occurs when writing 1s to the DSPINT or HINT bits in HPIC. In 
this case, the host access takes several CPU clock cycles, and SAM timings 
apply. Besides the HRDY timings and a faster cycle time, HOM access cycles 
are logically the same as SAM access cycles. A summary of the conditions un- 
der which the HRDY signal is active (where SAM timings apply) for host ac- 
cesses Is shown in Table 8—25. When HRDY is not active (HRDY stays high), 
HOM timings apply. Refer to the ‘54x data sheet for detailed HPI timing specifi- 
cations. 


Table 8-25. Wait-State Generation Conditions 


Access Sequences 


Wait State Generated 
Register Reads Writes 
HPIC No 1 to DSPINT/HINT — Yes 


All other cycles — No 


HPIA No HOM — No 
SAM — Yes 

HPID HOM — No HOM — No 
SAM — Yes SAM — Yes 


A complete host access cycle always involves two bytes, the first with HBIL 
low, and the second with HBIL high. This 2-byte sequence must be followed 
regardless of the type of host access (HPIA, HPIC, or data access) and the 
host must not break the first byte/second byte (HBIL low/high) sequence of an 
ongoing HPI access. If this sequence is broken, data may be lost, and an un- 
predictable operation may result. 


Before accessing data, the host must first initialize HPIC, in particular the BOB 
bit, and then HPIA (in this order, because BOB affects the HPIA access). After 
initializing BOB, the host can then write to HPIA with the correct byte align- 
ment. On an HPI memory read operation, after completion of the HPIA write, 
the HPI memory is read and the contents at the given address are transferred 
to the two 8-bit data latches, the first byte data latch and the second byte data 
latch. Table 8-26 illustrates the sequence involved in initializing BOB and 
HPIA for an HPI memory read. In this example, BOB is set to 0 and a read is 
requested of the first HPI memory location (in this case 1000h), which contains 
FFFEh. 
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Table 8-26. Initialization of BOB and HPIA 


Event HD HR/W~ HCNTL1/0 HBIL HPIC- HPIA _latch1 latch2 
Host writes HPIC, 1st byte 00 0 00 0 00xx XXXX XXXX XXXX 
Host writes HPIC, 2nd byte 00 0 00 1 0000 XXXX XXXX XXXX 
Host writes HPIA, 1st byte 10 0 10 0 0000 10xx XXXX XXXX 
Host writes HPIA, 2nd byte 00 0 10 1 1000 XXXX XXXX 
Internal HPI RAM read complete 1000 FF FIE 


In the cycle shown in Table 8—26, BOB and HPIA are initialized, and by loading 
HPIA, an internal HPI memory access is initiated. The last line of Table 8-26 
shows the condition of the HPI after the internal RAM read is complete; that 
is, after some delay following the end of the host write of the second byte to 
HPIA, the read is completed and the data has been placed in the upper and 
lower byte data latches. For the host to actually retrieve this data, it must per- 
form an additional read of HPID. During this HPID read access, the contents 
of the first byte data latch appears on the HD pins when HBIL is low and the 
content of the second byte data latch appears on the HD pins when HBIL is 
high. Then the address is incremented if autoincrement is selected and the 
memory is read again into the data latches. The sequence involved in this 
access is shown in Table 8-27. 


Table 8-27. Read Access to HPI With Autoincrement 


Event HD HR/W- HCNTLi1/0 HBIL- HPIC_ HPIA_latch1 latch2 
Host reads data, 1st byte [AF i 01 0 0000 1000 FF FE 
Host reads data, 2nd byte FE 1 01 1 0000 1000 FE FE 
Internal HPI RAM read complete 1001 6A BC 
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In the access shown in Table 8—27, the data obtained from reading HPID is the 
data from the read initiated in the previous cycle (the one shown in Table 8-26) 
and the access performed as shownin Table 8-27 also initiates a further read, 
this time at location 1001h (because autoincrement was specified in this ac- 
cess by setting HCNTL1/0 to 01). Also, when autoincrement is selected, the 
increment occurs with each 16-bit word transferred (not with each byte); there- 
fore, as shown in Table 8—27, the HPIA is incremented by only 1. The last line 
of Table 8—27 indicates that after the second internal RAM read is complete, 
the contents of location 1001h (6ABCh) has been read and placed into the up- 
per and lower byte data latches. 
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During a write access to the HPI, the first byte data latch is overwritten by the 
data coming from the host while the HBIL pin is low, and the second byte data 
latch is overwritten by the data coming from the host while the HBIL pin is high. 
At the end of this write access, the data in both data latches is transferred as 
a 16-bit word to the HPI memory at the address specified by the HPIA register. 
The address is incremented prior to the memory write because autoincrement 
is selected. 


An HPI write access is illustrated in Table 8—28. In this example, after the inter- 
nal portion of the write is completed, location 1002h of HPI RAM contains 
1234h. Ifa read of the same address follows this write, the same data just writ- 
ten in the data latches (1234h) is read back. 


Table 8-28. Write Access to HPI With Autoincrement 


Event HD HR/W~ HCNTL1/0 HBIL- HPIC- HPIA _latch1 latch2 
Host writes data, 1st byte 12 0 01 0 0000 1002 12 FE 
Host writes data, 2nd byte 34 0 01 1 0000 1002 12 34 
Internal HP! RAM write complete 1002 12 34 


8.6.4 DSPINT and HINT Function Operation 


The host and the ’54x can interrupt each other using bits in the HPIC register. 
This section presents more information about this process. 


Host Device Using DSPINT to Interrupt the 54x 


A ’54x interrupt is generated when the host writes a 1 to the DSPINT bit in 
HPIC. This interrupt can be used to wake up the ’54x from IDLE. The host and 
the ’54x always read this bit as 0. A’54x write has no effect. Once a 1 is written 
to DSPINT by the host, a 0 need not be written before another interrupt can 
be generated, and writing a 0 to this bit has no effect. The host should not write 
a1 to the DSPINT bit while writing to BOB or HINT, or an unwanted ’54x inter- 
rupt is generated. 


On the ’54x, the host-to-’'54x interrupt vector address is xx64h. This interrupt 
is located in bit 9 of the IMR/IFR. Since the ’54x interrupt vectors can be 
remapped into the HPI memory, the host can instruct the 54x to execute 
preprogrammed functions by simply writing the start address of a function to 
address xx65h in the HPI memory prior to interrupting the 54x with a branch 
instruction located at address xx64h. If the interrupts are remapped to the host 
port accessible on-chip RAM, you must use SAM and the host must not write 
to location xx00h to xx7Fh, except for xx65h. 
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Host Port Interface (54x) Using HINT to Interrupt the Host Device 


When the 54x writes a 1 to the HINT bitin HPIC, the HINT output is driven low; 
the HINT bitis read as a 1 by the ’54x or the host. The HINT signal can be used 
to interrupt the host device. The host device, after detecting the HINT interrupt 
line, can acknowledge and clear the ’54x interrupt and the HINT bit by writing 
a 1 to the HINT bit. The HINT bit is cleared and then read as a 0 by the 54x 
or the host, and the HINT pin is driven high. If the 54x or the host writes a 0, 
the HINT bit remains unchanged. While accessing the SMOD bit, the 54x 
should not write a 1 to the HINT bit unless it also wants to interrupt the host. 


8.6.5 Considerations in Changing HPI Memory Access Mode (SAM/HOM) and 
IDLE2/3 Use 


The HPI host-only mode (HOM) allows the host to access HPI RAM while the 
’BAx is in IDLE2/3 (that is, completely halted). Additionally, the external clock 
input to the 54x can be stopped for the lowest power consumption configura- 
tion. Under these conditions, random accesses can still be made without hav- 
ing to restart the external clock for each access and wait for its lockup time if 
the 54x on-chip PLL is used. The external clock need only be restarted before 
taking the 54x out of IDLE2/3. 


The host cannot access HPI RAM in SAM when the ’54x is in IDLE2/3, be- 
cause CPU clocks are required for access in this mode of operation. There- 
fore, if the host requires access to the HP! RAM while the ’54x is in IDLE2/3, 
the ‘54x must change HPI mode to HOM before entering IDLE2/3. When the 
HPI is in HOM, the 54x can access HPIC to toggle the SMOD bit or send an 
interrupt to the host, but cannot access the HPI RAM block; a ’54x access to 
the HPI RAM is disregarded in HOM. In order for the 54x to again access the 
HPI RAM block, HPI mode must be changed to SAM after exiting IDLE2/3. 


To select HOM, a 0 must be written to the SMOD bit in HPIC. To select SAM, 
a1 must be written to SMOD. When changing between HOM and SAM, two 
considerations must be met for proper operation. First, the instruction immedi- 
ately following the one that changes from SAM to HOM must not be an IDLE 2 
or IDLE 3. This is because in this case, due to the 54x pipeline and delays in 
the SAM to HOM mode switch, the IDLE2/3 takes effect before the mode 
switch occurs, causing the HPI to remain in SAM; therefore, no host accesses 
can occur. 
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The second consideration is that when changing from HOM to SAM, the 
instruction immediately following the one that changes from HOM to SAM can- 
not read the HPI RAM block. This requirement is due to the fact that the mode 
has not yet changed when the HPI RAM read occurs and the RAM read is 
ignored because the mode switch has not yet occurred. HPI RAM writes are 
not included in this restriction because these operations occur much later in 
the pipeline, so itis possible to write to HPI RAM in the instruction following the 
one which changes from HOM to SAM. 


On the host side, there are no specific considerations associated with the 
mode changes. For example, it is possible to have a third device wake up the 
54x from IDLE2/3 and the ‘54x changing to SAM upon wake-up without a soft- 
ware handshake with the host. The host can continue accessing while the HPI 
mode changes. However, if the host accesses the HPI RAM while the mode 
is being changed, the actual mode change will be delayed until the host access 
is completed. In this case, a 54x access to the HPI memory is also delayed. 


Table 8—29 illustrates the sequence of events involved in entering and exiting 
an IDLE2/3 state on the ‘54x when using the HPI. Throughout the process, the 
HPI is accessible to the host. 


Table 8-29. Sequence for Entering and Exiting IDLE2 and IDLE3 


“Host or Other Device 54x —s—*=<“<«*‘“‘<«=C*#‘éMOe:~«S=« SAX lO 
Switches mode to HOM HOM Running 
Executes a NOP HOM Running 
Executes IDLE 2 or IDLE 3 HOM Running 
instruction 
May stop DSP clock In IDLE2/3 HOM Stopped 
or running 
Turns on DSP clock ifit In IDLE2/3 HOM Running 
was stoppedt 
Sends an interrupt to In IDLE2/3 HOM Running 
DSP 
54x wakes up from IDLE2/3 HOM Running 
54x switches mode to SAM SAM Running 


t Sufficient wake-up time must be ensured when the ’54x on-chip PLL is used. 
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8.6.6 Access of HPI Memory During Reset 


The ’54x is not operational during reset, but the host can access the HPI, allow- 
ing program or data downloads to the HPI memory. When this capability is 
used, it is often convenient for the host to control the ’54x reset input. The se- 
quence of events for resetting the ‘54x and downloading a program to HPI 
memory while the ’54x is in reset is summarized in Table 8-30 and corre- 
sponds to the reset of the ’54x. 


Initially, the host stops accessing the HPI at least six ‘54x periods before driv- 
ing the ’54x reset line low. The host then drives the 54x reset line low and can 
start accessing the HPI after a minimum of four ’54x periods. The HPI mode 
is automatically set to HOM during reset, allowing high-speed program down- 
load. The 54x clock can even be stopped at this time; however, the clock must 
be running when the reset line falls and rises for proper reset operation of the 
"BAX. 


Once the host has finished downloading into HP! memory, the host stops 
accessing the HPI and drives the ’54x reset line high. At least 20 ’54x periods 
after the reset line rising edge, the host can again begin accessing the HPI. 
This number of periods corresponds to the internal reset delay of the 54x. The 
HPI mode is automatically set to SAM upon exiting reset. 


If the host writes a 1 to DSPINT while the ’54x is in reset, the interrupt is lost 
when the ’54x comes out of reset. The 54x warm boot can use the HPI memory 
and start execution from the lowest HPI address. 


Table 8—30. HPI Operation During RESET 


Host *B4x Mode *B4x CLK 
Waits 6 ’54x clock periods Running X Running 
Brings RESET low and waits 4 clocks Goes into reset HOM Running 
Can stop ’54x clock In reset HOM Stopped or running 
Writes program and/or data in HP! memory In reset HOM Stopped or running 
Turns on DSP clock if it was stoppedt In reset HOM Running 
Brings RESET high In reset HOM Running 
Waits 20 54x clock periods Comes out of reset SAM Running 
Can access HPI Running SAM Running 


t Sufficient wake-up time must be ensured when the ’54x on-chip PLL is used. 
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Serial Ports 


This chapter discusses the four serial port interfaces connected to the 54x 
core CPU: 


(J Standard synchronous serial port interface 

(1 Buffered serial port interface 

[1 Multichannel buffered serial Port (McBSP) interface 
1 Time-division multiplexed serial port interface 


These peripherals are controlled through registers that reside in the memory 
map. The serial ports are synchronized to the core CPU by way of interrupts. 
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9:2" :Sotlal Port intertace:.s..cnc. ncaa ane sate eee nen eeeke 9-4 
9.3 Buffered Serial Port (BSP) Interface ............:0eeeeeeeeeeeee 9-33] 
9.4 Time-Division Multiplexed (TDM) Serial Port Interface ........... 


9-1 


Introduction to the Serial Ports 


9.1 


Introduction to the Serial Ports 


4x devices implement a variety of types of flexible serial port interfaces. 
These serial port interfaces provide full duplex, bidirectional, communication 
with serial devices such as codecs, serial analog to digital (A/D) converters, 
and other serial systems. The serial port interface signals are directly compat- 
ible with many industry-standard codecs and other serial devices. The serial 
port may also be used for interprocessor communication in multiprocessing 
applications (the time-division multiplexed (TDM) serial port is especially opti- 
mized for multiprocessing). 


Table 9-1 lists the serial ports available on various 54x devices. 


Table 9-1. Serial Ports on the TMS320C54x Devices 


Standard MultiChannel Time-Division 

Synchronous Buffered Buffered Multiplexed 
Device Serial Ports Serial Ports Serial Ports Serial Ports 
B41 2 0 0 0 
542 0 1 0 1 
543 0 1 0 1 
545 1 1 0 0 
546 1 1 0 0 
548 0 2 0 1 
549 0 2 0 1 
5402 0 0 2 0 
5410 0 0 3 0 
5420 0 0 6 0 


Table 9-2 lists the sections that should be consulted for the various serial ports 
and their modes. 


Table 9-2. Sections that Discuss the Serial Ports 


Serial Port Mode 

Standard = 

Buffered Autobuffering 
Nonbuffered 
(standard) 

MCBSP Multichannel 

TDM TDM 
Non-TDM 
(standard) 
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See... 


section 9.2, Standard Serial Port Interface, 
on page 9-4. 


section 9.3, Buffered Serial Port (BSP) 
Interface, on page 9-33. 


section 9.2, Standard Serial Port Interface, 
on page 9-4. 


TMS320C54x Enhanced Peripherals, Vol- 
ume 5 (literature number SPRU302). 


section 9.4, Time-Division Multiplexed 
(TDM) Serial Port Interface, on page 9-56. 


section 9.2, Standard Serial Port Interface, 
on page 9-4. 
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9.2 Serial Port Interface 
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Four different types of serial port interfaces are available on ‘54x devices. The 
basic standard serial port interface is implemented on 541, 545, and ’546 de- 
vices. The TDM serial port interface is implemented on the '542, ’543, ’548, 
and '549 devices. The 542, ’543, 545, ’546, 548, and 549 devices include 
a buffered serial port (BSP) that implements an automatic buffering feature, 
which greatly reduces CPU overhead required in handling serial data trans- 
fers. The 5402, 5410, and 5420 devices include multichannel buffered serial 
ports (McBSPs). See Table 9-1 for information about the features included in 
various ’54x devices. 


The BSP operates in either autobuffering or nonbuffered mode. When oper- 
ated in nonbuffered (or standard) mode, the BSP functions the same as the 
basic standard serial port (except where specifically indicated) and is 
described in this section. The TDM serial port operates in either TDM or non- 
TDM mode. When operated in non-TDM (or standard) mode, the TDM serial 
port also functions the same as the basic standard serial port and is described 
in this section. 


The BSP also implements several enhanced features in standard mode. 
These features, together with operation of the BSP in autobuffering mode, are 
described in section 9.3, Buffered Serial Port (BSP) Interface, on page 9-33. 
Therefore, when using the 542, 543, 545, ’546, 548, and ’549 devices, you 
should consult section 9.3. 


Operation of the TDM serial port in TDM mode is described in section 9.4, 
Time-Division Multiplexed (TDM) Serial Port Interface, on page 9-56. Note that 
the BSP and TDM serial ports initialize to a standard serial port compatible 
mode upon reset. 


In all 54x serial ports, both receive and transmit operations are double- 
buffered, thus allowing a continuous communications stream with either 8- or 
16-bit data packets. The continuous mode provides operation that, once initi- 
ated, requires no further frame synchronization pulses (FSR and FSX) when 
transmitting at maximum packet frequency. The serial ports are fully static and 
thus will function at arbitrarily low clocking frequencies. The maximum operat- 
ing frequency for the standard serial port of one-fourth of CLKOUT (10 Mbit/s 
at 25 ns, 12.5 Mbit/s at 20 ns) is achieved when using internal serial port 
clocks. The maximum operating frequency for the BSP is CLKOUT. When the 
serial ports are in reset, the device may be configured to turn off the internal 
serial port clocks, allowing the device to run in a lower power mode of opera- 
tion. 
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9.2.1 Serial Port Interface Registers 


The serial port operates through the three memory-mapped registers (SPC, 
DXR, and DRR) and two other registers (RSR and XSR) that are not directly 
accessible to the program, but are used in the implementation of the double- 
buffering capability. These five registers are listed in Table 9-3. 


Table 9-3. Serial Port Registers 


Address Register Description 
ili DRR Data receive register 
t DXR Data transmit register 
tT SRG Serial port control register 
—_— RSR Receive shift register 
—_— XSR Data transmit shift register 


t See section 8.2, Peripheral Memory-Mapped Registers. 


L 


Data receive register (DRR). The 16-bit memory-mapped data receive 
register (DRR) holds the incoming serial data from the RSR to be written 
to the data bus. At reset, the DRR is cleared. 


Data transmit register (DXR). The 16-bit memory-mapped data transmit 
register (DXR) holds the outgoing serial data from the data bus to be 
loaded in the XSR. At reset, the DXR is cleared. 


Serial port control register (SPC). The 16-bit memory-mapped serial port 
control register (SPC) contains the mode control and status bits of the seri- 
al port. 


Data receive shift register (RSR). The 16-bit data receive shift register 
(RSR) holds the incoming serial data from the serial data receive (DR) pin 
and controls the transfer of the data to the DRR. 


Data transmit shift register (XSR). The 16-bit data transmit shift register 
(XSR) controls the transfer of the outgoing data from the DXR and holds 
the data to be transmitted on the serial data transmit (DX) pin. 


During normal serial port operation, the DXR is typically loaded with data to 
be transmitted on the serial port by the executing program, and its contents 
read automatically by the serial port logic to be sent out when a transmission 
is initiated. The DRR is loaded automatically by the serial port logic with data 
received on the serial port and read by the executing program to retrieve the 
received data. 
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At times during normal serial port operation, however, it may be desirable for 
a program to perform other operations with the memory-mapped serial port 
registers besides simply writing to DXR and reading from DRR. 


On the SP, the DXR and DRR may be read or written at any time regardless 
of whether the serial port is in reset or not. On the BSP, access to these regis- 
ters is restricted; the DRR can only be read, and the DXR can only be written 
when autobuffering is disabled (see section 9.3.2, Autobuffering Unit (ABU) 
Operation, on page 9-40). The DRR can only be written when the BSP is in 
reset. The DXR can be read at any time. 


Note, however, that on both the SP and the BSP, care should be exercised 
when reading or writing to these registers during normal operation. With the 
DRR, since, as mentioned previously, this register is written automatically by 
the serial port logic when data is received, if a write to DRR is performed, sub- 
sequent reads may not yield the result written if a serial port receive occurs 
after the write but before the read is performed. With the DXR, care should be 
exercised when this register is written, since if previously written contents in- 
tended for transmission have not yet been sent, these contents will be over- 
written and the original data lost. As mentioned previously, the DXR can be 
read at any time. 


Alternatively, DXR and DRR may also serve as general purpose storage if they 
are not required for serial port use. If these registers are to be used for general 
purpose storage, the transmit and/or receive sections of the serial port should 
be disabled either by tying off (by pulling up or down, whichever is appropriate) 
external input pins which could spuriously cause serial port transfers, or by 
putting the port in reset. 


9.2.2 Serial Port Interface Operation 
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This section describes operation of the basic standard serial port interface, 
which includes operation of the TDM and BSP serial ports when configured in 
standard mode. Table 9-4 lists the pins used in serial port operation. 
Figure 9-1 shows these pins for two ’54x serial ports connected for a one-way 
transfer from device 0 to device 1. Only three signals are required to connect 
from a serial port transmitter to a receiver for data transmission. The trans- 
mitted serial data signal (DX) sends the actual data. The transmit frame syn- 
chronization signal (FSX) initiates the transfer (at the beginning of the packet), 
and the transmit clock signal (CLKX) clocks the bit transfer. The corresponding 
pins on the receive device are DR, FSR and CLKR, respectively. 
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Table 9-4. Serial Port Pins 


Pin Description 

CLKR Receive clock signal 

CLKX Transmit clock signal 

DR Received serial data signal 

DX Transmitted serial data signal 

FSR Receive framing synchronization signal 
FSX Transmit frame synchronization signal 


Figure 9-1. One-Way Serial Port Transfer 


54x device 0 ’B4x device 1 
DX > DR 
> 
> 


Figure 9-2 shows how the pins and registers are configured in the serial port 
logic and how the double-buffering is implemented. 


Transmit data is written to the DXR, while received data is read from the DRR. 
A transmit is initiated by writing data to the DXR, which copies the data to the 
XSR when the XSR is empty (when the last word has been transmitted serially, 
that is, driven on the DX pin). The XSR manages shifting the data to the DX 
pin, thus allowing another write to DXR as soon as the DXR-to-XSR copy is 
completed. 


During transmits, upon completion of the DXR-to-XSR copy, a 0-to-1 transition 
occurs on the transmit ready (XRDY) bit inthe SPC. This 0-to-1 transition gen- 
erates a serial port transmit interrupt (XINT) that signals that the DXR is ready 
to be reloaded. See section 6.10, /nterrupts, on page 6-26 for more information 
on ’54x interrupts. 


The process is similar in the receiver. Data from the DR pin is shifted into the 
RSR, which is then copied into the DRR from which it may be read. Upon 
completion of the RSR-to-DRR copy, a 0-to-1 transition occurs on the receive 
ready (RRDY) bit in the SPC. This 0-to-1 transition generates a serial port re- 
ceive interrupt (RINT). Thus, the serial port is double-buffered because data 
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can be transferred to or from DXR or DRR while another transmit or receive 
is being performed. Note that transfer timing is synchronized by the frame sync 
pulse in burst mode (discussed in more detail in section 9.2.4, Burst Mode 
Transmit and Receive Operations, on page 9-18). 


Figure 9-2. Serial Port Interface Block Diagram 


k 
16 16} 
v Load Y 
control 
logic DXR (16) 
RINT "6 us 
on XINT on 
RSR-DRR <—+ eae (Load) | +» DXR-XSR 
transfer Logic transfer 


@ 
(Clear) (Clear) 
Byte/word ’ Byte/word 
Vv 
FSR FSX 
Y v 
DR CLKR CLKX DX 


9.2.3 Configuring the Serial Port Interface 


The SPC contains control bits which configure the operation of the serial port. 
The SPC bit fields are shown in Figure 9-3 and described in Table 9—5. Note 
that seven bits in the SPC are read only and the remaining nine bits are 
read/write. 


Figure 9-3. Serial Port Control Register (SPC) Diagram 


R/W RW R R R R R R RAW RW RW RW RW RW RW R 
Note: R =Read, W = Write 
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Table 9-5. Serial Port Control Register (SPC) Bit Summary 


Bit Name 


15 Free 


14 Soft 


13. RSRFULL 


12 XSREMPTY 


Reset 


Value Function 


0 


This bit is used in conjunction with the Soft bit to determine the state of the serial 
port clock when a breakpoint is encountered in the HLL debugger. See Table 9-6 
on page 9-17 for the serial port clock configurations. 


Free =0 The Soft bit selects the emulation mode. 
Free =1 The serial port clock runs free regardless of the Soft bit. 


This bit is used in conjunction with the Free bit to determine the state of the serial 
port clock when a breakpoint is encountered in the HLL debugger. When the Free 
bit is cleared to 0, the Soft bit selects the emulation mode. See Table 9-6 on page 
9-17 for the serial port clock configurations. 


Soft = 0 The serial port clock stops immediately, thus aborting any 
transmission. 


Soft = 1 The clock stops after completion of the current transmission. 


Receive Shift Register Full. This bit indicates whether the receiver has experi- 
enced overrun. Overrun occurs when RSR is full and DRR has not been read since 
the last RSR-to-DRR transfer. On the SP, when FSM = 1, the occurrence of a frame 
sync pulse on FSR qualifies the generation of RSRFULL = 1. When FSM =0, and 
on the BSP, only the basic two conditions apply; that is, RSRFULL goes high with- 
out waiting for an FSR pulse. 


RSRFULL = 0 Any one of the following three events clears the RSRFULL 
bit to 0: reading DRR, resetting the receiver (RRST bit to 0), 
or resetting the device. 


RSRFULL = 1 The port has recognized an overrun. When RSRFULL = 1, 
the receiver halts and waits for DRR to be read, and any data 
sent on DR is lost. On the SP, the data in RSR is preserved; 
on the BSP, the contents of RSR are lost. 


Transmit Shift Register Empty. This bit indicates whether the transmitter has expe- 
rienced underflow. Underflow occurs when XSR is empty and DXR has not been 
loaded since the last DXR-to-XSR transfer. 


XSREMPTY = 0 Any one of the following three events clears the XSREMPTY 
bit to 0: underflow has occurred, resetting the transmitter 
(XRST bit to 0), or resetting the device. 


XSREMPTY = 1 On the SP, XSREMPTY is deactivated (set to 1) directly as 
a result of writing to DXR; on the BSP, XSREMPTY is only 
deactivated after DXR is loaded followed by the occurrence 
of an FSX pulse. 
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Table 9-5. Serial Port Control Register (SPC) Bit Summary (Continued) 


Bit Name 
11. XRDY 
10 RRDY 
9 IN1 

8 INO 

7 RRST 
6 XRST 
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Reset 
Value 


{ 


Function 


Transmit Ready. A transition from 0 to 1 of the XRDY bit indicates that the DXR 
contents have been copied to XSR and that DXR is ready to be loaded with a new 
data word. A transmit interrupt (XINT) is generated upon the transition. This bit can 
be polled in software instead of using serial port interrupts. Note that on the SP, 
XRDY is generated directly as a result of writing to DXR; while on the BSP, XRDY 
is only generated after DXR is loaded followed by the occurrence of an FSX pulse. 
At reset or serial port transmitter reset (XRST = 0), the XRDY bit is set to 1. 


Receive Ready. A transition from 0 to 1 of the RRDY bit indicates that the RSR con- 
tents have been copied to the DRR and that the data can be read. A receive inter- 
rupt (RINT) is generated upon the transition. This bit can be polled in software 
instead of using serial port interrupts. At reset or serial port receiver reset 
(RRST = 0), the RRDY bit is cleared to 0. 


Input 1. This bit allows the CLKX pin to be used as a bit input. IN1 reflects the cur- 
rent level of the CLKX pin of the device. When CLKX switches levels, there is a 
latency of between 0.5 and 1.5 CLKOUT cycles before the new CLKX value is rep- 
resented in the SPC. 


Input 0. This bit allows the CLKR pin to be used as a bit input. INO reflects the cur- 
rent level of the CLKR pin of the device. When CLKR switches levels, there is a 
latency of between 0.5 and 1.5 CLKOUT cycles before the new CLKR value is rep- 
resented in the SPC. 


Receive Reset. This signal resets and enables the receiver. When a 0 is written 
to the RRST bit, activity in the receiver halts. 


RRST = 0 The serial port receiver is reset. Writing a 0 to RRST clears 
the RSRFULL and RRDY bits to 0. 


RRST = 1 The serial port receiver is enabled. 


Transmitter Reset. This signal is used to reset and enable the transmitter. When 
a 0 is written to the XRST bit, activity in the transmitter halts. When the XRDY bit 
is 0, writing a0 to XRST generates a transmit interrupt (XINT). 


XRST = 0 The serial port transmitter is reset. Writing a 0 to XRST clears 
the XSREMPTY bit to 0 and sets the XRDY bit to 1. 


XRST = 1 The serial port transmitter is enabled. 
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Table 9-5. Serial Port Control Register (SPC) Bit Summary (Continued) 


Bit Name 
5 TXM 
4 MCM 
3 FSM 
2 |50) 


Reset 
Value 


0 


Function 


Transmit Mode. This bit configures the FSX pin as an input (TXM = 0) or as an out- 


put (TXM = 1). 
TXM = 0 
TXM = 1 


External frame sync. The transmitter idles until a frame 
sync pulse is supplied on the FSX pin. 


Internal frame sync. Frame sync pulses are generated in- 
ternally when data is transferred from the DXR to XSR to initi- 
ate data transfers. The internally generated framing signal is 
synchronous with respect to CLKX. 


Clock Mode. This bit specifies the clock source for CLKX. 


MCM = 0 


MCM = 1 


CLKX is taken from the CLKX pin. 


CLKX is driven by an on-chip clock source. For the SP and 
the BSP in standard mode, this on-chip clock source is at a 
frequency of one-fourth of CLKOUT. The BSP also allows 
the option of generating clock frequencies at additional ratios 
of CLKOUT. For a detailed description of this feature, see 
section 9.3, Buffered Serial Port (BSP) Interface, on page 
9-33. Note that if MCM = 1 and DLB = 1, a CLKR signal is 
also supplied by the internal source. 


Frame Sync Mode. This bit specifies whether frame synchronization pulses (FSX 
and FSR) are required after the initial frame sync pulse for serial port operation. 
See section 9.2.2, Serial Port Interface Operation, on page 9-6 for more details 
on the frame sync signals. 


FSM =0 


FSM = 1 


Continuous mode. Frame sync pulses are not required af- 
ter the initial frame sync pulse, but they are not ignored; 
therefore, improperly timed frame syncs may cause errors in 
serial transfers. See section 9.2.6, Serial Port Interface Ex- 
ception Conditions, on page 9-26 for information about serial 
port operation under various exception conditions. 


Burst mode. A frame sync pulse is required on FSX/FSR 
for the transmission/reception of each word. 


Format. This bit specifies the word length of the serial port transmitter and receiver. 


BORO 


FO =1 


The data is transmitted and/or received as 16-bit words. 


The data is transferred as 8-bit bytes. The data is transferred 
with the MSB first. The BSP also allows the capability of 10- 
and 12-bit transfers. For a detailed description of this feature, 
see section 9.3, Buffered Serial Port (BSP) Interface, on 
page 9-33. 
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Table 9-5. Serial Port Control Register (SPC) Bit Summary (Continued) 


Bit Name 
1 DLB 


OQ Res 


Reserved Bit 


DLB Bit 
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Reset 
Value 


0 


0 


Function 


Digital Loopback Mode. This bit can be used to put the serial portin digital loopback 


mode. 


DLB =0 


DLB = 1 


The digital loopback mode is disabled. The DR, FSR, and 
CLKR signals are taken from their respective device pins. 


The digital loopback mode is enabled. The DR and FSR sig- 
nals are connected to DX and FSX, respectively, through 
multiplexers, as shown in Figure 9—4(a) and (b) on page 
9-13. Additionally, CLKR is driven by CLKX if MCM = 1. If 
DLB = 1 and MCM = 0, CLKR is taken from the CLKR pin of 
the device. This configuration allows CLKX and CLKR to be 
tied together externally and supplied by a common external 
clock source. The logic diagram for CLKR is shown in 
Figure 9—4(c) on page 9-13. Note also that in DLB mode, the 
FSX and DX signals appear on the device pins, but FSR and 
DR do not. Either internal or external FSX signals may be 
used in DLB mode, as defined by the TXM bit. 


Reserved. Always read as a 0 in the serial port. This bit performs a function in 


the TDM serial port discussed in section 9.4, Time-Division-Multiplexed (TDM) Se- 


rial Port Interface, on page 9-56. 


Bit 0 is reserved and is read as 0, although it performs a function in the TDM 
serial port (discussed in section 9.4, Time-Division-Multiplexed (TDM) Serial 
Port Interface, on page 9-56). 


The DLB (bit 1) selects digital loopback mode, which allows testing of serial 
port code with a single ’54x device. When DLB = 1, DR and FSR are connected 
to DX and FSX, respectively, through multiplexers, as shown in Figure 9-4. 


When in loopback mode, CLKR is driven by CLKX if on-chip serial port clock 
generation is selected (MCM = 1), but if MCM = 0, then CLKR is driven by the 
external CLKR signal. This allows for the capability of external serial port clock 
generation in digital loopback mode. If DLB = 0, then normal operation occurs 
where DR, FSR, and CLKR are all taken from their respective pins. 
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Figure 9-4. Receiver Signal Multiplexers 


FO Bit 
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CF a 
MCM 


CLKX 


The FO (bit 2) specifies whether data is transmitted as 16-bit words (FO = 0) 
or 8-bit bytes (FO = 1). Note that in the latter case, only the lower byte of what- 
ever is written to DXR is transmitted, and the lower byte of data read from DRR 
is what was received. To transmit a whole 16-bit word in 8-bit mode, two writes 
to DXR are necessary, with the appropriate shifts of the value because the up- 
per eight bits written to DXR are ignored. Similarly, to receive a whole 16-bit 
word in 8-bit mode, two reads from DRR are required, with the appropriate 
shifts of the value. In the SP, the upper eight bits of DRR are indeterminate in 
8-bit receptions; in the BSP, the unused bits of DRR are sign-extended. Addi- 
tionally, inthe BSP, transfers of 10- and 12-bit words are provided for additional 
flexibility. For a detailed description of this feature, refer to section 9.3, Buff- 
ered Serial Port (BSP) Interface, on page 9-33. 


The FSM (bit 3) specifies whether or not frame sync pulses are required in 
consecutive serial port transmits. If FSM = 1, a frame sync must be present for 
every transfer, although FSX may be either externally or internally generated 
depending on TXM. This mode is referred to as burst mode, because there are 
normally periods of inactivity on the serial port between transmits. 


The frequency with which serial port transmissions occur is called packet fre- 
quency, and data packets can be 8, 10, 12, or 16 bits long. Therefore, as pack- 
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MCM Bit 


TXM Bit 


et frequency increases, it reaches a maximum that occurs when the time, in 
serial port clock cycles, from one packet to the next, is equal to the number of 
bits being transferred. If transmission occurs at the maximum rate for multiple 
transfers in a row, however, frame sync essentially becomes redundant. Note 
that frame sync actually becomes redundant in burst mode only at maximum 
packet frequency with FSX configured as an output (TXM = 1). When FSX is 
an input (TXM = 0), its presence is required for transmissions to occur. 


FSM = 0 selects the continuous mode of operation which requires only an ini- 
tial frame sync pulse as long as a write to DXR (for transmit), or a read from 
DRR (for receive), is executed during each transfer. Note that when FSM = 0, 
frame sync pulses are not required, but they are not ignored, therefore, im- 
properly timed frame syncs may cause errors in serial transfers. The timing of 
burst and continuous modes is discussed in detail in sections 9.2.4, 9.2.5, and 
9.2.6. 


The serial port clock source is set by MCM (bit 4). If MCM = 0, CLKX is config- 
ured as an input and thus accepts an external clock. If MCM = 1, then CLKX 
is configured as an output, and is driven by an internal clock source. For the 
SP, and the BSP operating in standard mode, this on-chip clock is at a frequen- 
cy of one-fourth of CLKOUT. The BSP also allows the option of generating 
clock frequencies at additional ratios of CLKOUT. For a detailed description 
of this feature, refer to section 9.3, Buffered Serial Port (BSP) Interface, on 
page 9-33. Note that the CLKR pin is always configured as an input. 


The transmit frame synchronization pulse source is set by TXM (bit 5). Like 
MCM, if TXM = 1, FSX is configured as an output and generates a pulse at the 
beginning of every transmit. If TXM = 0, FSX is configured as an input, and ac- 
cepts an external frame sync signal. Note that the FSR pin is always config- 
ured as an input. 


XRST and RRST Bits 
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The serial port transmitter and receiver are reset with XRST (bit 6) and RRST 
(bit 7). These signals are active low, so that if XRST = RRST =0, the serial port 
is ina reset state. To reset and reconfigure the serial port, a total of two writes 
to the SPC are required. 


C1 The first write to the SPC should: 
m write a0 tothe XRST and RRST bits 
™ write the desired configuration to the remainder of the bits. 


INO and IN1 Bits 
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(1 The second write to the SPC should: 
H write 1 to the XRST and RRST bits 
mM resend the desired configuration to the remainder of the bits. 


The second write takes the serial port out of reset. Note that the transmitter and 
receiver may be reset individually if desired. When a 0 is written to XRST or 
RRST, activity in the corresponding section of the serial port stops. This mini- 
mizes the switching and allows the device to operate with lower power con- 
sumption. When XRST = RRST = MCM = 0, power requirements are further 
reduced since CLKX is no longer driven as an output. 


In IDLE2 and IDLE3 mode, SP operation halts as with other parts of the ’54x 
device. On the BSP, however, if the external serial port clock is being used, op- 
eration continues after an IDLE2/3 is executed. This allows power savings to 
still be realized in IDLE2/3, while still maintaining operation of critical serial port 
functions if necessary (see section 9.3, Buffered Serial Port (BSP) Interface, 
on page 9-33 for further information about BSP operation). 


It should also be noted that, on the SP, the serial port may be taken out of reset 
at any time. Depending on the timing of exiting reset, however, a frame sync 
pulse may be missed. On the BSP, for receive and transmit with external frame 
sync, a setup of at least one CLKOUT cycle plus 1/2 serial port clock cycle is 
required prior to FSX being sampled active in standard mode. In autobuffering 
mode, additional setup is required (see section 9.3, Buffered Serial Port (BSP) 
Interface, on page 9-33 for further information about BSP initialization timing 
requirements). 


INO (bit 8) and IN1 (bit 9) allow the CLKR and CLKX pins to be used as bit in- 
puts. INO and IN1 reflect the current states of the CLKR and CLKX pins. The 
data on these pins can be sampled by reading the SPC. This can be 
accomplished using the BIT, BITF, BITT, or CMPM instruction. Note that there 
is a latency of between 0.5 and 1.5 CLKOUT cycles in duration from CLKR/ 
CLKX switching to the new CLKR/CLKX value being available in the SPC. 
Note that even if the serial port is reset, INO and IN1 can still be used as bit 
inputs, and DRR and DXR as general-purpose registers. 


RRDY and XRDY Bits 


Bits 10—13 in the SPC are read-only status bits that indicate various states of 
serial port operation. Writes and reads of the serial port may be synchronized 
by polling RRDY (bit 10) and XRDY (bit 11), or by using the interrupts that they 
generate. A transition from 0 to 1 of the RRDY bit indicates that the RSR con- 
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XSREMPTY Bit 


RSRFULL Bit 
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tents have been copied to the DRR and that the received data may be read. 
A receive interrupt (RINT) is generated upon this transition. 


A transition from 0 to 1 of the XRDY bit indicates that the DXR contents have 
been copied to XSR and that DXR is ready to be loaded with a new data word. 
Atransmit interrupt (XINT) is generated upon this transition. Polling XRDY and 
RRDY in software may either substitute for or complement the use of serial 
port interrupts (both polling and interrupts may be used together if so desired). 
Note that with external FSX, on the SP, XSR is loaded directly as a result of 
loading DXR, while on the BSP, XSR is not loaded until an FSX occurs. 


The XSREMPTY (bit 12) indicates whether the transmitter has experienced 
underflow. XSREMPTY is an active low bit; therefore, when XSREMPTY = 0, 
an underflow has occurred. 


Any one of the following three conditions causes XSREMPTY to become 
active (XSREMPTY = 0): 


(1 DXR has not been loaded since the last DXR-to-XSR transfer, and XSR 
empties (the actual transition of XSREMPTY occurs after the last bit has 
been shifted out of XSR), 


Lj or the transmitter is reset (XRST = 0), 


1 or the 54x device is reset (RS = 0). 


When XSREMPTY = 0, the transmitter halts and stops driving DX (the DX pin 
is ina high-impedance state) until the next frame sync pulse. Note that under- 
flow does not constitute an error condition in the burst mode, although it does 
in the continuous mode (error conditions are further discussed in section 9.2.6, 
Serial Port Interface Exception Conditions, on page 9-26). 


The following condition causes XSREMPTY to become inactive (XSREMPTY = 1): 


(1 Awrite to DXR occurs on the SP, or on the BSP a write to DXR occurs fol- 
lowed by an FSX pulse (see section 9.2.4, Burst Mode Transmit and Re- 
ceive Operations, on page 9-18 for further information about transmit 
timing). 


The RSRFULL (bit 13) indicates whether the receiver has experienced over- 
run. RSRFULLis an active high bit; therefore, wnen RSRFULL = 1, RSRis full. 


In burst mode (FSM = 1), all three of the following must occur to cause 
RSRFULL to become active (RSRFULL = 1): 
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1 The DRR has not been read since the last RSR-to-DRR transfer, 
Li RSR is full, 
1 and a frame sync pulse appears on FSR. 


In continuous mode (FSM = 0), and on the BSP, only the first two conditions 
are necessary to set RSRFULL: 


1 The DRR has not been read since the last RSR-to-DRR transfer 
LJ and RSR is full. 


Therefore, in continuous mode, and on the BSP, RSRFULL occurs after the 
last bit has been received. 


When RSRFULL = 1, the receiver halts and waits for the DRR to be read, and 
any data sent on DR is lost. On the SP, the data in RSR is preserved; on the 
BSP, the RSR contents are lost. 


Any one of the following three conditions causes RSRFULL to become inactive 
(RSRFULL = 0): 


(J The DRR is read, 
(J or the serial port is reset (RRST = 0), 
OV or the 54x device is reset (RS = 0). 


SOFT and FREE Bits 


Soft (bit 14) and Free (bit 15) are special emulation bits that determine the state 
of the serial port clock when a breakpoint is encountered in the high-level 
language (HLL) debugger. If the Free bit is set to 1, then upon a software 
breakpoint, the clock continues to run (free runs) and data is still shifted out. 
When Free = 1, the Soft bit is a don’t care. If the Free bit is cleared to 0, then 
the Soft bit takes effect. If the Soft bit is cleared to 0, then the clock stops 
immediately, thus aborting any transmission. If the Soft bit is set to 1 anda 
transmission is in progress, the transmission continues until completion of the 
transfer, and then the clock halts. These options are listed in Table 9-6. 


The receive side functions in a similar fashion. Note that if an option other than 
immediate stop (Soft = Free = 0) is chosen, the receiver continues running and 
an overflow error is possible. The default value for these bits is immediate stop. 


Table 9-6. Serial Port Clock Configuration 
Free _— Soft Serial Port Clock Configuration 
0 0 Immediate stop, clocks are stopped. (Reset values) 


0 1 Transmitter stops after completion of the current word. The 
receiver is not affected. 


1 Xx Free run. 


Note: X=Don'tcare 
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9.2.4 Burst Mode Transmit and Receive Operations 


In burst mode operation, there are periods of serial port inactivity between 
packet transmits. The data packet is marked by the frame sync pulse occurring 
on FSX (see Figure 9-5). On the transmit device, the transfer is initiated by a 
write to DXR. The value in DXR is then transferred to XSR, and, upon a frame 
sync pulse on FSX (generated internally or externally depending on TXM), the 
value in XSR is shifted out and driven on the DX pin. Note that on the SP, the 
DXR to XSR transfer occurs on the second rising edge of CLKX after DXR is 
loaded, while on the BSP this transfer does not occur until an FSX occurs, 
when FSX is external. When FSX is internal on the BSP, the DXR to XSR trans- 
fer and generation of FSX occur directly after loading DXR. On both the SP and 
the BSP, once XSR is loaded with the value from DXR, XRDY goes high, gen- 
erating a transmit interrupt (XINT) and setting XSREMPTY to a 1. 


Figure 9-5. Burst Mode Serial Port Transmit Operation 
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Note that in both the SP and the BSP, DXR to XSR transfers occur only if the 
XSR is empty and the DXR has been loaded since the last DXR to XSR trans- 
fer. lf DXR is reloaded before the old DXR contents have been transferred to 
XSR, the previous DXR contents are overwritten. Accordingly, unless overwrit- 
ing DXR is intended, the DXR should only be loaded if XRDY = 1. This is 
assured if DXR writes are made only in response to a transmit interrupt or 
polling XRDY. 
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It should be noted that in the following discussions, the timings are slightly dif- 
ferent for internally (TXM = 1, FSX is an output) and externally (TXM = 0, FSX 
is aninput) generated frame syncs. This distinction is made because in the for- 
mer case, the frame sync pulse is generated by the transmitting device as a 
direct result of a write to DXR. In the latter case, there is no such direct effect. 
Instead, the transmitting device must write to DXR and wait for an externally 
generated frame sync. 


If internal frame sync pulse generation is selected (TXM = 1), a frame sync 
pulse is generated on the second rising edge of CLKX following awrite to DXR. 
For externally generated frame syncs, the events described here will occur as 
soon as a properly timed frame sync pulse occurs (see the data sheet for 
detailed serial port interface timings). 


On the next rising edge of CLKX after FSX goes high, the first data bit (MSB 
first) is driven onthe DX pin. Thus, ifthe frame sync pulse is generated internal- 
ly (TXM = 1), there is a 2-CLKX cycle latency (approximately) after DXR is 
loaded, before the data is driven on the line. If frame sync is externally gener- 
ated, data transmission is delayed indefinitely after a DXR load until the FSX 
pulse occurs (this is described in further detail later in this section). With the 
falling edge of frame sync, the rest of the bits are shifted out. When all the bits 
are transferred, DX enters a high-impedance state. 


Atthe end of each transmission, if DXR was not reloaded when XINT was gen- 
erated, XSREMPTY becomes active (low) at this point, indicating underflow. 
With externally generated frame sync, if XSREMPTY is active and a frame 
sync pulse is generated, any old data in the DXR is transmitted. This is 
explained in detail in section 9.2.6, Serial Port Interface Exception Conditions, 
on page 9-26. 


Note that the first data bit transferred could have variable length if frame sync 
is generated externally and does not fall within one CLKX cycle (this is illus- 


trated in Figure 9-6). Internally generated frame syncs are assured by ’54x 
timings to be one CLKX cycle in duration. 


Figure 9-6. Serial Port Transmit With Long FSX Pulse 
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Serial port transmit with external frame sync pulses is similar to that with inter- 
nal frame sync, with the exception that transfers do not actually begin until the 
external frame sync occurs. If the external frame sync occurs many CLKX 
cycles after DXR is loaded, however, the double buffer is filled and frozen until 
frame sync appears. 


On the SP (Figure 9—7), when the delayed frame sync occurs, Ais transmitted 
on DX; after the transmit, a DXR-to-XSR copy of B occurs, XINT is generated, 
and again, the transmitter remains frozen until the next frame sync. When 
frame sync finally occurs, B is transmitted on DX. Note that when B is loaded 
into DXR, a DXR-to-XSR copy of B does not occur immediately because A has 
not been transmitted, and no XINT is generated. Any subsequent writes to 
DXR before the next delayed frame sync occurs overwrite B in the DXR. 


Figure 9—7. Burst Mode Serial Port Transmit Operation With Delayed Frame Sync 
in External Frame Sync Mode (SP) 
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On the BSP (Figure 9-8), since DXR was reloaded with B shortly after being 
loaded with A when the delayed frame sync finally occurs, B is transmitted on 
DX. After the transmit, the transmitter remains frozen until the next frame sync. 
When frame sync finally occurs, B is again transmitted on DX. Note that when 
B is loaded into DXR, a DXR-to-XSR copy of B does not occur immediately 
since the BSP requires a frame sync to initiate transmitting. Any subsequent 
writes to DXR before the next delayed frame sync occurs overwrite B in the 
DXR. 
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Figure 9-8. Burst Mode Serial Port Transmit Operation With Delayed Frame Sync 
in External Frame Sync Mode (BSP) 
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During a receive operation, shifting into RSR begins on the falling edge of the 
CLKR cycle after frame sync has gone low (as shown in Figure 9-9). Then, 
as the last data bit is being received, the contents of the RSR are transferred 
to the DRR on the falling edge of CLKR, and RRDY goes high, generating a 
receive interrupt (RINT). 


Figure 9-9. Burst Mode Serial Port Receive Operation 
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If the DRR from a previous receive has not been read, and another word is 
received, no more bits can be accepted without causing data corruption since 
DRR and RSR are both full. In this case, the RSRFULL bit is set indicating this 
condition. On the SP, this occurs with the next FSR; on the BSP, RSRFULL is 
set on the falling edge of CLKR during the last bit received. RSRFULL timing 
on both the SP and BSP is shown in Figure 9-10. 
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Figure 9-10. Burst Mode Serial Port Receive Overrun 


aK AP) ARARRALRARRARPRRIN 


MSB 


= 


RINT / \ 


RSRFULL(SP) f \ 


RSRFULL(BSP) aS. f | eee 
———_—_----- 


9-22 


DRR loaded Read A 
with A from DRR 


Unlike transmit underflow, overrun (RSRFULL = 1) constitutes an actual error 
condition. While DRR contents are preserved in overrun, its occurrence can 
often result in loss of other received data. 


Overrun is handled differently on the SP and on the BSP. On the SP, the con- 
tents of RSR are preserved on overrun, but since RSRFULL is not set to 1 until 
the next FSR occurs after the overflowing reception, incoming data usually 
begins being lost as soon as RSRFULL is set. Data loss can only be avoided 
if RSRFULL is polled in software and the DRR is read immediately after 
RSRFULL is set to 1. This is normally possible only if the CLKR frequency is 
slow with respect to CLKOUT, since RSRFULL is set on the falling edge of 
CLKR during FSR, and data begins being received on the following rising edge 
of CLKR. The time available for polling RSRFULL and reading the DRR to 
avoid data loss is, therefore, only half of one CLKR cycle. 


On the BSP, RSRFULL is set on the last valid bit received, but the contents of 
RSR are never transferred to DRR, therefore, the complete transferred word 
in RSR is lost. If the DRR is read (clearing RSRFULL) before the next FSR 
occurs, subsequent transfers can be received properly. 


Overrun and various other serial port exception conditions such as the occur- 
rence of frame sync during a receive are discussed in further detail in section 
9.2.6, Serial Port Interface Exception Conditions, on page 9-26. 


If the serial port receiver is provided with FSR pulses significantly longer than 
one CLKR cycle, timing of data reception is effected in a similar fashion as with 
long FSX pulses. With long FSR pulses, however, the reception of all bits, 
including the first one, is simply delayed until FSR goes low. Serial port receive 
operation with a long FSR pulse is illustrated in Figure 9-11. 
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Figure 9-11. Serial Port Receive With Long FSR Pulse 


Note that if the packet transmit frequency is increased, the inactivity period 
between the data packets for adjacent transfers decreases to zero. This corre- 
sponds to a minimum period between frame sync pulses (equivalent to 8 or 
16 CLKX/R cycles, depending on FO) that corresponds to a maximum packet 
frequency at which the serial port may operate. At maximum packet frequency, 
transmit timing is a compressed version of Figure 9-5, as shown in 
Figure 9-12. 


Figure 9-12. Burst Mode Serial Port Transmit at Maximum Packet Frequency 
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At maximum packet frequency, the data bits in consecutive packets are trans- 
mitted contiguously with no inactivity between bits. The frame sync pulse over- 
laps the last bit transmitted in the previous packet. Maximum packet frequency 
receive timing is similar and is shown in Figure 9-13. 
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Figure 9-13. Burst Mode Serial Port Receive at Maximum Packet Frequency 


I ac + I th a 
oade joade read 
from RSR read from RSR 


As shown in Figure 9-12 and Figure 9-13, with the transfer of multiple data 
packets at maximum packet frequency in burst mode, packets are transmitted 
at a constant rate, and the serial port clock provides sufficient timing informa- 
tion for the transfer, which permits a continuous stream of data. Therefore, the 
frame sync pulses are essentially redundant. Theoretically, then, only an initial 
frame sync signal is required to initiate the multipacket transfer. The 54x does 
support operation of the serial port in this fashion, referred to as continuous 
mode, which is selected by clearing the FSM bit in the SPC to 0. Continuous 
mode serial port operation is described in detail in section 9.2.5, Continuous 
Mode Transmit and Receive Operations. 


9.2.5 Continuous Mode Transmit and Receive Operations 


In continuous mode, a frame sync on FSX/FSR is not necessary for consecu- 
tive packet transfers at maximum packet frequency after the initial pulse. Con- 
tinuous mode is selected by setting FSM = 0. Note that when FSM = 0, frame 
sync pulses are not required, but they are not ignored, therefore, improperly 
timed frame syncs may cause errors in serial transfers. Serial port operation 
under various error conditions is described in detail in section 9.2.6, Serial Port 
Interface Exception Conditions, on page 9-26. 


In continuous mode transmission, one frame sync is generated following the 
first DXR load, and no further frame syncs are generated. As long as DXR is 
reloaded once every transmission, continuous transfers are maintained. Fail- 
ing to update DXR causes the serial port to halt, as in the burst mode case 
(XSREMPTY becomes asserted, etc). If DXR is reloaded after a halt, the 
device begins continuous mode transmission again and generates a single 
FSX, assuming that internal frame sync generation is selected. 
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The distinction between internal and external frame syncs for continuous 
mode is similar to that of burst mode, as discussed in section 9.2.4, Burst Mode 
Transmit and Receive Operations, on page 9-18. If frame sync is externally 
generated (TXM = 0), then when DXR is loaded, the appearance of the frame 
sync pulse initiates continuous mode transmission. Continuous mode trans- 
mission may be discontinued (and burst mode resumed) only by reconfiguring 
and resetting the serial port (see section 9.2.2, Serial Port Interface Operation, 
on page 9-6). Simply changing the FSM bit during transmit or halt will not 
properly switch to burst mode. 


Continuous mode transmit timing, shown in Figure 9-14, is similar to maxi- 
mum packet frequency transmission in burst mode as shown in Figure 9-12. 
The major difference is the lack of a frame sync pulse after the initial one. As 
long as DXR is updated once per transmission, this mode will continue. Over- 
writes to DXR behave just as in burst mode; the last data written will be 
transmitted. XSR operation is the same as in burst mode. A new external FSX 
pulse will abort the present transmission, cause one data packet to be lost, and 
initiate a new continuous mode transmit. This is explained in more detail in sec- 
tion 9.2.6, Serial Port Interface Exception Conditions, on page 9-26. 


Figure 9-14. Continuous Mode Serial Port Transmit 
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Continuous mode reception is similar to the transmit operation. After the initial 
frame sync pulse on FSR, no further frame syncs are required. This mode will 
continue as long as DRR is read every transmission. If DRR is not read by the 
end of the next transfer, the receiver will halt, and RSRFULL is set, indicating 
overrun. See section 9.2.6, Serial Port Interface Exception Conditions. 
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Overrun in continuous mode effects the SP and the BSP differently. On the SP, 
once overrun has occurred, reading DRR will restart continuous mode at the 
next word/byte boundary after DRR is read; no new FSR pulse is required. On 
the BSP, continuous mode reception does not resume until DRR is read and 
an FSR occurs. 


Continuous mode reception may only be discontinued by reconfiguring and 
resetting the serial port. Simply changing the FSM bit during a reception or halt 
will not properly switch to burst mode. Continuous mode receive timing is 
shown in Figure 9-15. 


Figure 9-15. Continuous Mode Serial Port Receive 
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Figure 9-15 shows only one frame sync pulse; otherwise, it is similar to 
Figure 9-13. If a pulse occurs on FSR during a transfer (an error), then the 
receive operation is aborted, one packet is lost, and a new receive cycle is 
begun. This is discussed in more detail in section 9.2.2, Serial Port Interface 
Operation, on page 9-6 and in section 9.2.6, Serial Port Interface Exception 
Conditions. 


9.2.6 Serial Port Interface Exception Conditions 
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Exception (or error) conditions result from an unexpected event occurring on 
the serial port. These conditions are operational aberrations such as overrun, 
underflow, or a frame sync pulse during a transfer. Understanding how the 
serial port handles these errors and the state it acquires during these error 
conditions is important for efficient use of the serial port. Because the error 
conditions differ slightly in burst and continuous modes, they are discussed 
separately. 
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Burst Mode 


In burst mode, one type of error condition (presented in section 9.2.2, Serial 
Port Interface Operation) is receive overrun, indicated by the RSRFULL flag. 
This flag is set when the device has not read incoming data and more data is 
being sent. If this condition occurs, the processor halts serial port receives until 
DRR is read. Thus, any further data sent may be lost. 


Overrun is handled differently on the SP and on the BSP. On the SP, the con- 
tents of RSR are preserved on overrun, but since RSRFULL is not set to 1 until 
the next FSR occurs after the overflowing reception, incoming data usually 
begins being lost as soon as RSRFULL is set. Data loss can only be avoided 
if RSRFULL is polled in software and the DRR is read immediately after 
RSRFULL is set to 1. This is normally possible only if the CLKR frequency is 
slow with respect to CLKOUT, since RSRFULL is set on the falling edge of 
CLKR during FSR, and data begins being received on the following rising edge 
of CLKR. The time available for polling RSRFULL and reading the DRR to 
avoid data loss is, therefore, only half of one CLKR cycle. 


On the BSP, RSRFULL is set on the last valid bit received, but the contents of 
RSR are never transferred to DRR, therefore, the complete transferred word 
in RSR is lost. If the DRR is read (clearing RSRFULL) before the next FSR 
occurs, subsequent transfers can be received properly. 


Another type of receive error is caused if frame sync occurs during a receive 
(that is, data is being shifted into RSR from DR). If this happens, the present 
receive is aborted and anew one begins. Thus, the data that was being loaded 
into RSR is lost, but the data in DRR is not (no RSR-to-DRR copy occurs). 
Burst mode serial port receiver behavior under normal and error conditions for 
the SP is shown in Figure 9-16 and for the BSP is shown in Figure 9-17. 


Figure 9-16. SP Receiver Functional Operation (Burst Mode) 
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Figure 9-17. BSP Receiver Functional Operation (Burst Mode) 
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Transmitter exception conditions in burst mode may occur for several possible 
reasons. Underflow, which is described in section 9.2.3, Configuring the Serial 
Port Interface, on page 9-8 is an exception condition that may occur in burst 
mode, however, underflow is not normally considered an error. An exception 
condition that causes errors in transmitted data occurs when frame sync 
pulses occur at inappropriate times during a transfer. If a transmit is in progress 
(that is, XSR data is being driven on DX) when a frame sync pulse occurs, the 
transmission is aborted, and the data in XSR is lost. Then, whatever data is 
in DXR at the time of the frame sync pulse is transferred to XSR (DXR-to-XSR 
copy) and is transmitted. Note, however, that in this case an XINT is generated 
only if the DXR has been written to since the last transmit. Also, if XSREMPTY 
is active and a frame sync pulse occurs, the old data in DXR is shifted out. 
Figure 9-18 summarizes serial port transmit behavior under error and noner- 
ror conditions. Note that if an FSX occurs when no transmit is in progress, and 
DXR has been reloaded since the last transmit, the DXR-to-XSR copy and 
generation of transmit interrupt occur at this point only on the BSP. On the SP, 
these two events occur at the time the DXR was reloaded. 
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Figure 9-18. SP/BSP Transmitter Functional Operation (Burst Mode) 
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Continuous Mode 


In continuous mode, errors take on a broader meaning, since data transfer is 
intended to occur at all times. Thus, underflow (KSREMPTY = 0) constitutes 
an error in continuous mode because data will not be transmitted. As in burst 
mode, overrun (RSRFULL = 1) is also an error, and in continuous mode, both 
overrun and underflow cause the serial port receive or transmit sections, re- 
spectively, to halt (see section 9.2.3, Configuring the Serial Port Interface, on 
page 9-8 for a description of these conditions). Fortunately, underflow and 
overrun errors may not be catastrophic; they can often be corrected simply by 
reading DRR or writing to DXR. 


The SP and the BSP are affected differently when overrun occurs in continu- 
ous mode. Inthe SP, when DRR is read to deactivate RSRFULL, a frame sync 
pulse is not required in order to resume continuous mode operation. The 
receiver keeps track of the transfer word boundary, even though it is not 
receiving data. Therefore, when the RSRFULL flag is deactivated by a read 
from DRR, the receiver begins reading from the correct bit. On the BSP, since 
an FSR pulse is required to restart continuous reception, this also reesta- 
blishes the proper bit alignment, in addition to restarting reception. 
Figure 9-19 shows receiver functional operation in continuous mode. 
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Figure 9-19. SP/BSP Receiver Functional Operation (Continuous Mode) 
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During a receive in continuous mode, if a frame sync pulse occurs, this causes 
areceive abort condition, and one packet of data is lost (this is caused because 
the frame sync pulse resets the RSR bit counter). The data present on DR then 
begins being shifted into RSR, starting again from the first bit. Note that if a 
frame sync occurs after deactivating the RSRFULL flag by reading DRR, but 
before the beginning of the next word boundary, this also creates a receive 
abort condition. 


Another cause for error is the appearance of extraneous frame syncs during 
a transmission. After the initial frame sync in continuous mode, no others are 
required; if an improperly timed frame sync pulse occurs during a transmit, the 
current transfer (thatis, serially driving XSR data onto DX) is aborted, anddata 
in XSR is lost. A new transmit cycle is initiated, and transfers continue as long 
as the DXR is updated once per transmission afterward. Figure 9-20 shows 
continuous mode transmitter functional operation. 


Note that if XSREMPTY is active in continuous mode and an external frame 
sync occurs, the previous DXR data is transmitted as in burst mode operation. 
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Figure 9-20. SP/BSP Transmitter Functional Operation (Continuous Mode) 
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9.2.7 Example of Serial Port Interface Operation 


As anillustration of the proper operation of a standard serial port, Example 9-1 
and Example 9-2 define a sequence of actions. This illustration is based on 
the use of interrupts to handle the normal I/O between the serial port and CPU. 


The ’545 peripheral configuration has been used as a reference for these 
examples. 
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Example 9-1. Serial Port Initialization Routine 


Action 


Reset and initialize the serial port by 
writing 0038h (or 0008h) to SPC. 


Clear any pending serial port inter- 
rupts by writing OOCOh to IFR. 


Enable the serial port interrupts by 
ORing 00COh with IMR. 


Enable interrupts globally (if neces- 
sary) by clearing the INTM bitin ST1. 


Start the serial port by writing OOF8h 
(or 0OC8h) to SPC. 


Write the first data value to DXR. (If 
the serial port is connected to the se- 
rial port of another processor and 
this processor will be generating 
FSX, a handshake must be per- 
formed prior to writing the first data 
value to DXR.) 


Description 


This places both the transmit and receive portions of the serial port 


in reset and sets up the serial port to operate with internally gener- 
ated FSX and CLKX signals and FSX/FSR required for transmit/ 
receive of each 16-bit word. (The alternative is used if another de- 
vice will provide FSX and CLKX.) 


Eliminate any interrupts that may have occurred before initializa- 
tion. 


Enable both transmit and receive interrupts. A common alterna- 
tive when transmit and receive are synchronized to one another 
is to enable only one or the other, by ORing 0080h or 0040h with 
IMR, and performing both I/O operations with the same interrupt 
service routine (ISR). 


Interrupts must be globally enabled for the CPU to respond. 


This takes both the transmit and receive portions of the serial port 
out of reset and starts operations with the conditions defined in 
step 1. 


This initiates serial port transmit operations if FSX and CLKX are 
internally generated or prepares the serial port transmit for opera- 
tion when the first FSX arrives. 


Example 9-2. Serial Port Interrupt Service Routine 
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Action 


Save any context that may be modi- 
fied on the stack. 


Read the DRR or write the DXR or 
both. The data read form DRR 
should be written to a predetermined 
location in memory. The data written 
to DXR should be read from a prede- 
termined location in memory. 


Restore the context that was saved 
in step 1. 


Return from the ISR with an RETE to 
reenable interrupts. 


Description 


The operating context of the interrupted code must be maintained. 


Read the received data for the receive ISR. Write the new transmit 
data for the transmit ISR. Or, do both if the ISR is combined for 
transmit and receive. 


The operating context of the interrupted code must be maintained. 


Interrupts must be reenabled for the CPU to respond to the next 
interrupt. 
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9.3 Buffered Serial Port (BSP) Interface 


The buffered serial port (BSP) is made up of a full-duplex, double-buffered 
serial port interface, which functions in a similar manner to the 54x standard 
serial port, and an autobuffering unit (ABU) (see Figure 9-21). The serial port 
section of the BSP is an enhanced version of the ’54x standard serial port. The 
ABU is an additional section of logic which allows the serial port section to 
read/write directly to 54x internal memory independent of the CPU. This 
results in a minimum overhead for serial port transfers and faster data rates. 


The full duplex BSP serial interface provides direct communication with serial 
devices such as codecs, serial A/D converters, and other serial devices with 
a minimum of external hardware. The double-buffered BSP allows transfer of 
a continuous communication stream in 8-,10-,12- or 16-bit data packets. 
Frame synchronization pulses as well as a programmable frequency serial 
clock can be provided by the BSP for transmission and reception. The polarity 
of frame sync and clock signals are also programmable. The maximum operat- 
ing frequency is CLKOUT (40 Mbit/s at 25 ns, 50 Mbit/s at 30 ns). The BSP 
transmit section includes a pulse code modulation (PCM) mode that allows 
easy interface with a PCM line. Operation of the BSP in standard (nonbuffered) 
mode is detailed in section 9.3.1 on page 9-35. 


The ABU has its own set of circular addressing registers, each with corre- 
sponding address generation units. Memory for transmit and receive buffers 
resides within a special 2K word block of 54x internal memory. This memory 
can also be used by the CPU as general purpose storage, however, this is the 
only memory block in which autobuffering can occur. 


Using autobuffering, word transfers occur directly between the serial port 
section and the ’54x internal memory automatically using the ABU embedded 
address generators. The length and starting addresses of the buffers within 
the 2K block are programmable, and a buffer empty/full interrupt can be gener- 
ated to the CPU. Buffering can easily be halted using the autodisabling capa- 
bility. ABU operation is detailed in section 9.3.2 on page 9-40. 


The BSP autobuffering capability can be separately enabled for the transmit 
and receive sections. When autobuffering is disabled (standard mode), data 
transfers with the serial port section occur under software control in the same 
fashion as with the standard ’54x serial port. In this mode, the ABU is transpar- 
ent, and the WXINT and WRINT interrupts generated each time a word is 
transmitted or received are sent to the CPU as transmit interrupt (BXINT) and 
receive interrupt (BRINT). When autobuffering is enabled, the BXINT and 
BRINT interrupts are only generated to the CPU each time half of the buffer 
is transferred. 
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Figure 9-21. BSP Block Diagram 
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As mentioned previously, most aspects of BSP operation are similar to that of 
the ’54x standard serial port. section 9.2, Serial Port Interface, on page 9-4 
discusses operation of both the ’54x standard serial port and the BSP in 
standard mode. Since standard mode BSP operation is a superset of standard 
serial port operation, section 9.2, Serial Port Interface, should first be studied 
before the rest of this section is read. 


System considerations of BSP operation such as initialization and low power 
modes are discussed in section 9.3.3 on page 9-49. 
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9.3.1 BSP Operation in Standard Mode 


BSP operation in standard mode is discussed in section 9.2, Serial Port Inter- 
face, on page 9-4. This section summarizes the differences between serial 
port operation and standard mode BSP operation and describes the enhanced 
features that the BSP offers. The enhanced BSP features are available both 
in standard mode and in autobuffering mode. ABU is discussed in section 9.3.2 
on page 9-40. Information presented in this section assumes familiarity with 
standard mode operation as described in section 9.2, Serial Port Interface. 


The BSP uses its own dedicated memory-mapped data transmit, data receive 
and serial port control registers (BDXR, BDRR, and BSPC). The BSP also uti- 
lizes an additional control register, the BSP control extension register 
(BSPCE), inimplementing its enhanced features and controlling the ABU. The 
BDRR, BDXR, and BSPC registers function similarly to their counterparts in 
the serial port as described in section 9.2, Serial Port Interface. As with the se- 
rial port, the BSP transmit and receive shift registers (BXSR and BRSR) are 
not directly accessible in software but facilitate the double-buffering capability. 
If the serial port is not being used, the BDXR and the BDRR registers can be 
used as general purpose registers. In this case, BFSR should be set to an inac- 
tive state to prevent a possible receive operation from being initiated. Note, 
however, that program access to BDXR or BDRR is limited when autobuffering 
is enabled for transmit or receive, respectively. BDRR can only be read, and 
BDXR can only be written when the ABU is disabled. BDRR can only be written 
when the BSP is in reset. BDXR can be read any time. 


The buffered serial port registers are summarized in Table 9-7. The ABU 


utilizes several additional registers which are discussed in section 9.3.2, Auto- 
buffering Unit (ABU) Operation, on page 9-40. 


Table 9-7. Buffered Serial Port Registers 


Address Register Description 


ili BDRR 16-bit BSP data receive register 

t BDXR 16-bit BSP data transmit register 

t BSPC 16-bit BSP control register 

Ti BSPCE — 16-bit BSP control extension register 


— BRSR 16-bit BSP data receive shift register 
— BXSR 16-bit BSP data transmit shift register 


t See section 8.2, Peripheral Memory-Mapped Registers. 
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9.3.1.1 Differences Between Serial Port and BSP Operation in Standard Mode 


The differences between serial port and BSP operation in standard mode are 
discussed in detail in the standard mode serial port operation (section 9.2 on 
page 9-4). These differences relate primarily to boundary conditions, howev- 
er, in some systems, these differences may be significant. The differences are 
summarized in Table 9-8. 


Table 9-8. Differences Between Serial Port and BSP Operation in Standard 


Mode 


Condition 
RSRFULL is set. 


Preservation of data in RSR on 
overrun. 


Continuous mode receive restart 
after overrun. 


Sign extension in DRR on 8-, 10-, 
or 12-bit transfers. 


XSR load, XSREMPTY clear, 
XRDY/XINT generation. 


Program accessibility to DXR and 
DRR. 


Maximum serial port clock rate. 
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Serial Port 


RSRFULL is set when RSR is full 
and then an FSR occurs, except in 
continuous mode where RSRFULL 
is set as soon as RSR is full. 


RSR contents are preserved on 
overrun. 


Receive restarts as soon as DRR 
is read (see section 9.2.6, Serial 
Port Interface Exception Condi- 
tions, on page 9-26). 


No 


Occur when DXR is loaded. 


DRR and DXR can be read or writ- 
ten under program control at any 
time. Note that caution should be 
exercised when reads and writes of 
the DRR may be close in time to 
serial port receptions. In this case, 
a DRR read may not yield the 
result that was previously written 
by the program. Also note that 
rewrites of DXR may cause loss 
(and therefore non-transmission) of 
previously written data depending 
on the relative timing of the writes 
and FSX (see section 9.2.4, Burst 
Mode Transmit and Receive Op- 
erations, on page 9-18). 


CLKOUT/4 


BSP 


RSRFULL is set as soon as BRSR 
is full. 


BRSR contents are not preserved 
on overrun. 


Receive does not restart until 
BDRR is read and then a BFSR 
occurs. 


Yes 


Occur when when a BFSX occurs 
after BDXR is loaded. 


BDRR can only be read and BDXR 
can only be written when the ABU 
is disabled. BDRR can only be 
written when the BSP is in reset. 
BDXR can be read any time. The 
same precautions with regard to 
reads and writes to these registers 
apply as in serial port. 


CLKOUT 
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Table 9-8. Differences Between Serial Port and BSP Operation in Standard 


Mode (Continued) 
Condition Serial Port BSP 
Initialization timing requirements. On the serial port, the serial port On the BSP, exiting serial port 
may be taken out of reset at any reset under certain conditions must 
time with respect to FSX/FSR, precede FSX timing by one 


however, if XRST/RRST go high CLKOUT cycle in standard mode 

during or after the frame sync, the and by six CLKOUT cycles in 

frame sync may be ignored. autobuffering mode (see section 
9.3.3, System Considerations of 
BSP Operation, on page 9-49). 


Operates in IDLE2/3 mode. No Yes (see section 9.3.3, System 
Considerations of BSP Operation, 
on page 9-49). 


9.3.1.2 Enhanced BSP Features 


The enhanced features that the BSP offers include the capability to generate 
programmable rate serial port clocks, select positive or negative polarities for 
clock and frame sync signals, and to perform transfers of 10- and 12-bit words, 
in addition to the 8- and 16-bit transfers offered by the serial port. Additionally, 
the BSP implements the capability to specify that frame sync signals be 
ignored until instructed otherwise, and provides a dedicated operating mode 
which facilitates its use with PCM interfaces. 


The BSPCE contains the control and status bits that are used in the imple- 
mentation of these enhanced BSP features and the ABU. The 10 LSBs of 
BSPCE are dedicated to the enhanced features control, whereas the 6 MSBs 
are used for ABU control, which is discussed in section 9.3.2, Autobuffering 
Unit (ABU) Operation, on page 9-40. Figure 9-22 shows the BSPCE bit posi- 
tions and Table 9-9 summarizes the function of the BSPCE bits. The value of 
the BSPCE upon reset is 3. This results in standard mode operation compat- 
ible with the serial port. 


Figure 9-22. BSP Control Extension Register (BSPCE) Diagram — Serial Port Control 


Bits 
15-10 4-0 
ABU control CLKP CLKDV 


R/(W R/W R/W- R/W R/W R/W 
Note: R= Read, W = Write 
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Table 9-9. BSP Control Extension Register (BSPCE) Bit Summary — 
Serial Port Control Bits 
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Reset 
Name value 


ABU —_ 
control 


PCM 0 


FIG 0 


FE 0 


CLKP 0 


FSP 0 


Function 


Reserved for autobuffering unit control (see section 9.3.2, Autobuffering Unit 
(ABU) Operation, on page 9-40). 


Pulse Code Modulation Mode. This control bit puts the serial port in pulse code 
modulation (PCM) mode. The PCM mode only affects the transmitter. BDXR-to- 
BXSR transfer is not affected by the PCM bit value. 


PCM =0 Pulse code modulation mode is disabled. 


PCM = 1 Pulse code modulation mode is enabled. In PCM mode, BDXR is 
transmitted only if its most significant (215) bit is set to 0. If this bit 
is set to 1, BDXR is not transmitted and BDX is put in high imped- 
ance during the transmission period. 


Frame Ignore. This control bit operates only in transmit continuous mode with 
external frame and in receive continuous mode. 


FIG =0 Frame sync pulses following the first frame pulse restart the trans- 
fer. 


FIG = 1 Frame sync pulses following the first frame pulse that initiates a 
transfer operation are ignored. 


Format Extension. The FE bit in conjunction with FO in SPC (section 9.2.3, Setting 
the Serial Port Configuration, on page 9-8) specifies the word length. When 
FO FE = 00, the format is 16-bit words; when FO FE = 01, the format is 10-bit 
words; when FO FE = 10, the format is 8-bit words; and when FO FE = 11, the for- 
mat is 12-bit words. Note that for 8-, 10-, and 12-bit words, the received words are 
right justified and the sign bit is extended to form a 16-bit word. Words to transmit 
must be right justified. See Table 9-10 for the word length configurations. 


Clock Polarity. This control bit specifies when the data is sampled by the receiver 
and transmitter. 


CLKP =0 Data is sampled by the receiver on BCLKR falling edge and sent 
by the transmitter on BCLKX rising edge. 


CLKP = 1 Data is sampled by the receiver on BCLKR rising edge and sent 
by the transmitter on BCLKX falling edge. 


Frame Sync Polarity. This control bit specifies whether frame sync pulses (BFSX 
and BFSR) are active high or low. 


FSP =0 Frame sync pulses (BFSX and BFSR) are active high. 


FSP = 1 Frame sync pulses (BFSX and BFSR) are active low. 
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Table 9-9. BSP Control Extension Register (BSPCE) Bit Summary — 
Serial Port Control Bits (Continued) 


Reset 
Bit Name value Function 


4-0 CLKDV 00011 Internal Transmit Clock Division factor. When the MCM bit of BSPC is set to 1, 
CLKxX is driven by an on-chip source having a frequency equal to 1/(CLKDV+1) 
of CLKOUT. CLKDV range is 0-31. When CLKDV is odd or equal to 0, the CLKX 
duty cycle is 50%. When CLKDV is an even value (CLKDV=2p), the CLKX high 
and low state durations depend on CLKP. When CLKP is 0, the high state duration 
is p+1 cycles and the low state duration is p cycles; when CLKP is 1, the high state 
duration is p cycles and the low state duration is p+1 cycles. 


Table 9-10. Buffered Serial Port Word Length Configuration 


FO FE Buffered Serial Port Word Length Configuration 


0 0 16-bit words transmitted and received. (Reset values) 
0 1 10-bit words transmitted and received. 

1 0 8-bit words transmitted and received. 

1 1 12-bit words transmitted and received. 


These enhanced features allow greater flexibility in serial port interface in a 
variety of areas. In particular, the frame ignore feature offers a capability which 
allows a mechanism for effectively compressing transferred data packets if 
they are not transferred in 16 bit format. This feature is used with continuous 
receptions and continuous transmits with external frame sync. When FIG = 0, 
if a frame sync pulse occurs after the initial one, the transfer is restarted; when 
FIG = 1, this frame sync is ignored. Setting FIG to 1 allows, for example, effec- 
tively achieving continuous 16-bit transfers under circumstances where frame 
sync pulses occur every 8-, 10- or 12-bits. Without using FIG, each transfer 
of less than 16 bits requires an entire 16-bit memory word, and each 16 bits 
transferred as two 8-bit bytes requires two memory words and two transfer 
operations, rather than one of each. Using FIG, therefore, can result in a signif- 
icant improvement in buffer size requirement in both autobuffered and 
standard mode, and a significant improvement in CPU cycle overhead 
required to handle serial port transfers in standard mode. Figure 9-23 shows 
an example with the BSP configured in 16-bit format but with a frame sync after 
8 bits. 
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Figure 9-23. Transmit Continuous Mode with External Frame and FIG = 1 
(Format Is 16 Bits) 
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9.3.2 Autobuffering Unit (ABU) Operation 


Since ABU functionality is a superset of standard mode serial port operation, 
the following sections should be read prior to reading this section: section 9.2, 
Serial Port Interface, on page 9-4, and section 9.3.1, BSP Operation in Stan- 
dard Mode, on page 9-35 Also, you should note that when operating in auto- 
buffering mode, the serial port control and status bits in BSPC and BSPCE 
function in the same fashion as in standard mode. 


The ABU implements the capability to move data transferred on the serial port 
to and from internal 54x memory independent of CPU intervention. 


The ABU utilizes five memory-mapped registers: the address transmit register 
(AXR), the block size transmit register (BKX), the address receive register 
(ARR), and the block size receive register (BKR), along with the BSPCE. 
These registers are summarized in Table 9-11. 


Table 9-11. Autobuffering Unit Registers 


Address Register Description 


iT BSPCE 16-bit BSP control extension register 

Tr AXR 11-bit BSP address transmit register (ABU) 

T BKX 11-bit BSP transmit buffer size register (ABU) 
T ARR 11-bit BSP address receive register (ABU) 

Ti BKR 11-bit BSP receive buffer size register (ABU) 


t See section 8.2, Peripheral Memory-Mapped Registers. 
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Figure 9-24 shows the block diagram of the ABU. The BSPCE contains bits 
which control ABU operation and will be discussed in detail later in this section. 
AXR, BKX, ARR, and BKR, along with their associated circular addressing log- 
ic, allow address generation for accessing words to be transferred between 
the 54x internal memory and the BSP data transmit register (BDXR) and BSP 
data receive register (BDRR) in autobuffering mode. The address and block 
size registers as well as circular addressing are also discussed in detail later 
in this section. 


Note that the 11-bit memory mapped AXR, BKX, ARR, and BKR registers are 
read as 16-bit words, with the five most significant bits read as zeroes and the 
11-bit register contents right justified in the least significant 11 bits. If autobuf- 
fering is not used, these registers can be used for general purpose storage of 
11-bit data. 


The transmit and receive sections of the ABU can be enabled separately. 
When either section is enabled, access to its corresponding serial port data 
register (BDXR or BDRR) through software is limited. The BDRR can only be 
read, and the BDXR can only be written when the ABU is disabled. The BDRR 
can only be written when the BSP is in reset. The BDXR can be read any time. 
When either transmit or receive autobuffering is disabled, that section oper- 
ates in standard mode, and its portion of the ABU is transparent. 


The ABU also implements CPU interrupts when transmit and receive buffers 
have been halfway or entirely filled or emptied. These interrupts take the place 
of the transmit and receive interrupts in standard mode operation (the receive 
interrupt is the CPU). They are not generated in autobuffering mode. This 
mechanism features an autodisabling capability that can be used to automati- 
cally terminate autobuffering when either the half-of-buffer or bottom-of-buffer 
boundary is crossed. These features are described in detail later in this sec- 
tion. 


Serial Ports 9-41 


Buffered Serial Port (BSP) Interface 


Figure 9-24. ABU Block Diagram 
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Burst or continuous mode, as described in section 9.2, Serial Port Interface, 
can be used in conjunction with the autobuffering capability. Note that due to 
the nature of autobuffering mode, however, if burst mode with internal frame 
syncis selected, this will effectively result in continuous transmission with FSX 
generated by the BSP at the start of each transmission. 
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The internal 54x memory used for autobuffering consists of a 2K-word block 
of dual-access memory that can be configured as data, program, or both (as 
with other dual-access memory blocks). This memory can also be used by the 
CPU as general purpose storage, however, this is the only memory block in 
which autobuffering can occur. Since the BSP is implemented on several dif- 
ferent TMS320 devices, the actual base address of the ABU memory may not 
be the same in all cases. The memory map for the particular device being used 
should be consulted for the actual base address of its ABU memory. 


When the ABU is enabled, this 2K-word block of memory can still be accessed 
by the CPU within data and/or program spaces. Conflicts may therefore occur 
between the CPU and the ABU if the 2K-word block is accessed at the same 
time by both. If a conflict does occur, priority is given to the ABU, resulting in 
the CPU access being delayed by one cycle. Accordingly, the worst case situa- 
tion is that a CPU access could be delayed one cycle each time the ABU 
accesses the memory block, that is, for every new word transmitted or 
received. Note that when on-chip program memory is secured using the ROM 
protection feature, the 2K-word block of ABU memory cannot be mapped to 
program memory. For further information regarding the ROM protection fea- 
ture, see section 3.5, Program and Data Security, on page 3-30. 


When the ABU is enabled for both transmit and receive, if transmit and receive 
requests from the serial port interface occur at same time, the transmit request 
takes priority over the receive request. In this case, the transmit memory 
access occurs first, delaying the receive memory access by generating a wait 
state. When the transmit memory access is completed, the receive memory 
access takes place. 


9.3.2.1 Autobuffering Control Register 


The most-significant six bits in the BSPCE constitute the ABU control register 
(ABUC). Some of these bits are read only, while others are read/write. 
Figure 9-25 shows the ABUC bit positions and Table 9-12 summarizes the 
function of each ABUC bit in the BSPCE. The value of the BSPCE upon reset 
is 3. 


Figure 9-25. BSP Control Extension Register (BSPCE) Diagram — ABU Control Bits 
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15 14 13 12 11 10 

HALTR HALTX Serial Port control 
R/W R R/W R/W R R/W 

Note: R= Read, W = Write 
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Table 9-12. BSP Control Extension Register (BSPCE) Bit Summary — 


Bit 
15 


14 


13 


12 
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Name 
HALTR 


RH 


BRE 


HALTX 


ABU Control Bits 
Reset 
value Function 
0 Autobuffering Receive Halt. This control bit determines whether autobuffering 
receive is halted when the current half of the buffer has been received. 
HALTR = 0 Autobuffering continues to operate when the current half of the 
buffer has been received. 
HALTR = 1 Autobuffering is halted when the current half of the buffer has 
been received. When this occurs, the BRE bit is cleared to 0 
and the serial port continues to operate in standard mode. 
0 Receive Buffer Half Received. This read-only bit indicates which half of the 
receive buffer has been filled. Reading RH when the RINT interrupt occurs 
(seen either as a program interrupt or by polling IFR) is a convenient way to 
identify which boundary has just been crossed. 
RH = 0 The first half of the buffer has been filled and that receptions 
are currently placing data in the second half of the buffer. 
RH = 1 The second half of the buffer has been filled and that recep- 
tions are currently placing data in the first half of the buffer. 
0 Autobuffering Receive Enable. This control bit enables autobuffering receive. 
BRE = 0 Autobuffering is disabled and the serial port interface operates 
in standard mode. 
BRE Autobuffering is enabled for the receiver. 
0 Autobuffering Transmit Halt. This control bit determines whether autobuffering 


transmit is halted when the current half of the buffer has been transmitted. 


HALTX = 0 Autobuffering continues to operate when the current half of the 
buffer has been transmitted. 


HALTX = 1 Autobuffering is halted when the current half of the buffer has 
been transmitted. When this occurs, the BXE bit is cleared to 
0 and the serial port continues to operate in standard mode. 
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Table 9-12. BSP Control Extension Register (BSPCE) Bit Summary — 
ABU Control Bits (Continued) 


Reset 
Bit Name value Function 
11. XH 0 Transmit Buffer Half Transmitted. This read-only bit indicates which half of the 
transmit buffer has been transmitted. Reading XH when the XINT interrupt oc- 
curs (seen either as a program interrupt or by polling IFR) is a convenient way 
to identify which boundary has just been crossed. 

XH =0 The first half of the buffer has been transmitted and transmis- 
sions are currently taking data from the second half of the 
buffer. 

XH = 1 The second half of the buffer has been transmitted and trans- 
missions are currently taking data from the first half of the 
buffer. 

10 BXE 0 Autobuffering Transmit Enable. This control bit enables the autobuffering trans- 
mit. 

BXE = 0 Autobuffering is disabled and the serial port operates in stan- 
dard mode. 

BXE = 1 Autobuffering is enabled for the transmitter. 

9-0 Serial Port — Serial Port Interface Control bits (see section 9.3.1.2, Enhanced BSP Features, 
control on page 9-37). 


9.3.2.2 Autobuffering Process 


The autobuffering process occurs between the ABU and the 2K-word block of 
ABU memory. Each time a serial port transfer occurs, the data involved is auto- 
matically transferred to or from a buffer in the 2K-word block of memory under 
control of the ABU. During serial port transfers in autobuffering mode, inter- 
rupts are not generated with each word transferred as they are in standard 
mode operation. This prevents the overhead of having the CPU directly 
involved in each serial port transfer. Interrupts are generated to the CPU only 
each time one of the half-buffer boundaries is crossed. 


Within the 2K-word block of ABU memory, the starting address and size of the 
buffers allocated is programmable using the 11-bit address registers (AXR and 
ARR) and the 11-bit block size registers (BKX and BKR). The transmit and 
receive buffers can reside in independent areas, overlapping areas or the 
same area, which allows transmitting from a buffer while receiving into the 
same buffer if desired. 
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The autobuffering process utilizes a circular addressing mechanism to access 
buffers within the 2K word block of ABU memory. This mechanism operates 
in the same fashion for transmit and receive. For each direction (transmit or 
receive), two registers specify the buffer size and the current address in the 
buffer. These registers are the block size and address register for transmit and 
receive (BKX, BKR, ARX, ARR, respectively). Each of the block size and 
address register pairs fully specify the top and bottom of buffer addresses for 
transmit and receive. Note that this circular addressing mechanism only 
effects accesses into the 2K-word block by the ABU. Accesses to this memory 
by the CPU are performed strictly according to the addressing mode(s) 
selected in the assembly language instructions which perform the memory 
access. 


The circular addressing mechanism automatically recirculates ABU memory 
accesses through the specified buffer, returning to the top of the buffer each 
time the bottom of the buffer is reached. The circular addressing mechanism 
is initialized by loading BKX/R with the exact size of the desired buffer (as 
opposed to size—1) and ARX/R with a value which contains both the base 
address of the buffer within the 2K word block and the initial starting address 
within this buffer (this is explained in detail below). Often the initial starting 
address within the buffer is 0, indicating the start of the buffer (the top-of-buffer 
address), but the initial starting address may be any point within the defined 
buffer range. 


Once initialized, BKX/R can be considered to consist of two parts; the most 
significant or higher part (BKH), which corresponds to the all of the most signifi- 
cant 0 bits of BKX/R, and the lower part (BKL), which is the remaining bits, of 
which the most significant bit is a 1 and whose bit position is designated bit 
position N. The N bit position also defines the two parts (ARH and ARL) of the 
address register. The top of buffer address (TBA) is defined by the concatena- 
tion of ARH with N+1 least significant 0 bits. The bottom of buffer address 
(BBA) is defined by the concatenation of ARH and BKL-1, and the current 
address within the buffer is specified by the complete contents of ARX/R. A 
circular buffer of size BKX/R must therefore start on an N-bit boundary (the N 
least significant bits of the address register are 0) where N is smallest integer 
that satisfies 2N > BKX/R, or at the lowest address within the 2K memory 
block. The buffer consists of two halves: the address range for the first half is 
TBA > (BKL/2) — 1 and for the second half BKL/2 > (BKL — 1). Figure 9-26 
illustrates all of the relationships between the defined buffer and the BKX/R 
and ARX/R registers, the bottom of circular buffer address (BBA), and the top 
of circular buffer address (TBA). 
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Figure 9-26. Circular Addressing Registers 
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The minimum block size for an ABU buffer is two; the maximum block size is 
2047, and any buffer of 2047 to 1024 words must start at a relative address 
of 0x0000 with respect to the base address of the 2K block of ABU memory. 
If either of the address registers (AXR or ARR) is loaded with a value specifying 
a location that is outside the range of the currently allocated buffer size as 
defined by BKX/R, improper operation may result. Subsequent memory 
accesses will be performed starting at the location specified, despite the fact 
that they will be to locations which are outside the range of the desired buffer, 
and the ARX/R will be incremented with each access until its contents reach 
the next permitted buffer start address. Any further accesses are then per- 
formed using the correct circular buffering algorithm with the new ARX/R 
contents as the updated buffer start address. It should be noted that any 
accesses performed with improperly loaded ARX/Rs may therefore unexpect- 
edly corrupt some memory locations. 


The following example illustrates some of these functional aspects of the auto- 
buffering process. Consider a transmit buffer of size 5 (BKX = 5) and a receive 
buffer of size 8 (BKR = 8) as shown in Figure 9—27. The transmit buffer may 
start at any relative address that is a multiple of 8 (address 0x0000, 0x0008, 
0x0010, 0x0018, ---, Ox07F8), and the receive buffer may start at any relative 
address that is a multiple of 16 (0x0000, 0x0010, 0x0020, ---, OxO7FO). In this 
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example, the transmit buffer starts at relative address 0x0008 and the receive 
buffer starts at relative address 0x0010. AXR may therefore contain any value 
in the range 0x0008—0x000C and ARR may contain any value in the range 
0x0010-—0x0017. If AXR in this example had been loaded with the value 
0x000D (not acceptable in a modulo 5 buffer), memory accesses would be 
performed and AXR incremented until it reaches address 0x0010 which is an 
acceptable starting address for a modulo 5 buffer. Note, however, that if this 
had occurred, AXR would then specify a transmit buffer starting at the same 
base address as the receive buffer, which may cause improper buffer opera- 
tion. 


Figure 9-27. Transmit Buffer and Receive Buffer Mapping Example 
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The autobuffering process is activated upon request from serial port interface 
when XRDY or RRDY goes high, indicating that a word has been received. The 
required memory access is then performed, following which an interrupt is 
generated if half of the defined buffer (first or second) has been processed. 
The RH and XH flags in BSPCE indicate which half has been processed when 
the interrupt occurs. 
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When autodisabling is selected (HALTX or HALTR bit is set), then when the 
next half (first or second) buffer boundary is encountered, the autobuffering 
enable bit in the BSPCE (BXE or BRE) is cleared so that autobuffering is 
disabled and does not generate any further requests. When transmit autobuf- 
fering is halted, transmission of the current XSR contents and the last value 
loaded in DXR are completed, since these transfers have already been initi- 
ated. Therefore, when using the HALTX function, some delay will normally 
occur between crossing a buffer boundary and transmission actually stopping. 
If it is necessary to identify when transmission has actually ended, software 
should poll for the condition of XRDY = 1 and XSREMPTY = 0, which occurs 
after last bit has been transmitted. 


In the receiver, when using HALTR, since autobuffering is stopped when the 
most recent buffer boundary is crossed, future receptions may be lost, unless 
software begins servicing receive interrupts at this point, since BDRR is no 
longer being read and transferred to memory automatically by the ABU. For 
explanation of how the serial port operates in standard mode when DRR is not 
being read, refer to section 9.2.6, Serial Port Interface Exception Conditions, 
on page 9-26. 


The sequence of events involved in the autobuffering process is summarized 
as follows: 


1) The ABU performs the memory access to the buffer. 


2) The appropriate address register is incremented unless the bottom of buff- 
er has been reached, in which case the address register is modified to 
point to the top of buffer address. 


3) Generate an BXINT or BRINT and update XH/RH if the half buffer or bot- 
tom of buffer boundary has been crossed. 


4) Autodisable the ABU if this function has been selected and if the half buffer 
or bottom of buffer boundary has been crossed. 
9.3.3 System Considerations for BSP Operation 


This section discusses several system-level considerations of BSP operation. 
These considerations include initialization timing issues, software initialization 
of the ABU, and power down mode operation. 
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9.3.3.1 Timing of Serial Port Initialization 
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The ’54x device utilizes a fully static design, and accordingly, in both the serial 
port and the BSP, serial port clocks need not be running between transfers or 
prior to initialization. Therefore, proper operation can still result if FSX/FSR 
occurs simultaneously with CLKX/CLKR starting. Regardless of whether 
serial port clocks have been running previously, however, the timing of serial 
port initialization, and most importantly, when the port is taken out of reset, can 
be critical for proper serial port operation. The most significant consideration 
of this is when the port is taken out of reset with respect to when the first frame 
sync pulse occurs. 


Initialization timing requirements differ on the serial port and the BSP. On the 
serial port, the serial port may be taken out of reset at any time with respect 
to FSX/FSR, however, if XRST/RRST go high during or after the frame sync, 
the frame sync may be ignored. In standard mode operation on the BSP for 
receive, and for transmit with external frame sync (TXM = 0), the BSP must be 
taken out of reset at least two full CLKOUT cycles plus 1/2 serial port clock 
cycle prior to the edge of the clock which detects the active frame sync pulse 
(whether the clock has been running previously or not) for proper operation. 
See Figure 9-28. 


Transmit operations with internal clock and frame sync are not subject to this 
requirement since frame sync is internally generated automatically (after 
XRST is cleared (set to 1)) when BDXR is loaded. 


Note, however, that if external serial port clock is used with internal frame sync, 
frame sync generation may be delayed depending on the timing of clearing 
XRST with respect to the clock. 


Figure 9-28 illustrates the standard mode BSP initialization timing 
requirements for the transmitter. The figure shows standard mode operation 
with external frame (TXM = 0) and clock (MCM = 0), active high frame sync 
(FSP = 0), and data sampled on rising edge (CLKP = 0). In this example, if the 
BFSX pulse occurs during the first two BCLKXs after the transmit section is 
taken out of reset, the transmit frame is ignored and BDxX is placed in the high 
impedance state. 
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Figure 9-28. Standard Mode BSP Initialization Timing 


BCLKX 
e 


rn rn 
o wvwleaos 
. ee 


BFSX / \ 


BDX 


#—_—T?—__ 1 CLKOUT +1/2 Serial Port clock cycle 


XRST / 


In autobuffering mode, for receive, and transmit with external frame sync 
(TXM = 1), the BSP must be taken out of reset at least six CLKOUT cycles plus 
1/2 serial port clock cycle prior to the edge of the clock which detects the active 
frame sync pulse (whether the clock has been running previously or not) for 
proper operation. This is due to the time delay for the ABU logic to be activated. 
See Figure 9-29. 


Transmit operations with internal clock and frame sync are not subject to this 
requirement since frame sync is internally generated automatically after XRST 
is cleared. 


Note, however, that if external serial port clock is used with internal frame sync, 
and if the clock is not running when XRST is cleared, frame sync generation 
may be delayed depending on the timing of clearing XRST with respect to the 
clock. 


Figure 9-29 illustrates autobuffering mode initialization timing requirements 
for the transmitter with external clock and frame sync. The figure shows 
standard mode operation with external frame (TXM = 0) and clock (MCM = 0), 
active high frame sync (FSP = 0), and data sampled on rising edge 
(CLKP = 0). 


Serial Ports 9-51 


Buffered Serial Port (BSP) Interface 


Figure 9-29. Autobuffering Mode Initialization Timing 


9.3.3.2 


9-52 


BCLKX 


o 
’ 


rn 


a 


vw 


co 


fen re. 
vile. 
ee 


BFSX / \ 
BDxX ——————+-{_ XXX XK XK) 


<> 6 CLKOUT +1/2 Serial Port clock cycle 


XRST / 


XRDY \ 


Initialization Examples 


In order to start or restart BSP operation in standard mode, the same steps are 
performed in software as with initializing the serial port (see section 9.2, Serial 
Port Interface, on page 9-4), in addition to which, the BSPCE must be 
initialized to configure any of the enhanced features desired. To start or restart 
the BSP in autobuffering mode, a similar set of steps must also be performed, 
in addition to which, the autobuffering registers must be initialized. 


As an illustration of the proper operation of a buffered serial port, Example 9-3 
and Example 9-4 define a sequence of actions. This illustration is based on 
the use of interrupts to handle the normal I/O between the serial port and CPU. 
The ’545 peripheral configuration has been used as a reference for these 
examples. The examples illustrate initializing the buffered serial port for 
autobuffering mode operation. In both cases, assume that transmit and 
receive interrupts are used to service the ABU interrupts, however, polling of 
the interrupt flag register (IFR) could also be used. Both the transmit and 
receive sections can be initialized at the same time or separately depending 
upon system requirements. 


Example 9-3 initializes the serial port for transmit operations only, with burst 
mode, external frame sync, and external clock selected. The selected data 
format is 16 bits, with frame sync and clock polarities selected to be high true. 
Transmit autobuffering is enabled by setting the BXE bit in the ABUC section 
of BSPCE, and HALTX has been set to 1, which causes transmission to halt 
when half of the defined buffer is transmitted. 


Example 9-4 initializes the serial port for receive operations only, with 
continuous mode selected. Frame sync and clock polarities are selected to be 
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low true, data format is 16 bits, and frame ignore is selected so that two 
received data bytes are packed into a single received word to minimize 
memory requirements. Receive autobuffering is enabled by setting the BRE 
bit in the ABUC section of BSPCE. 


In Example 9-3 and Example 9-4, the transmit and receive interrupts used 
are those that the BSP occupies on the 542, ’543, 545, 546, 548, and 549, 
the devices that include the BSP. However, on other devices that use the BSP, 
different interrupts may be used; and therefore, you should consult the appro- 
priate device documentation. 


Example 9-3. BSP Transmit Initialization Routine 


Action 


Reset and initialize the serial port 
by writing 0008h to BSPC. 


Clear any pending serial port in- 
terrupts by writing 0020h to IFR. 


Enable the serial port interrupts 
by ORing 0020h with IMR. 


Enable interrupts globally (if nec- 
essary) by clearing the INTM bit 
in ST1. 


Initialize the ABU transmit by 
writing 1400h to BSPCE. 


Write the buffer start address to 
AXR. 


Write the buffer size to BKX. 


Start the serial port by writing 
0048h to BSPC. 


Description 


This places both the transmit and receive portions of the serial port 


in reset and sets up the serial port to operate with externally gener- 
ated FSX and CLKX signals and FSX required for transmit/receive of 
each 16-bit word. 


Eliminate any interrupts that may have occurred before initialization. 
Enable transmit interrupts. 


Interrupts must be globally enabled for the CPU to respond. 


This causes the BSP to stop transmitting at the end of the buffer until 
another FSX is received. 


Identify the first buffer address to the ABU. 


Identify the buffer size to the ABU. 


This takes the transmit portion of the serial port out of reset and starts 
operations with the conditions defined in steps 1 and 5. 
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Example 9-4. BSP Receive Initialization Routine 


Action 


Reset and initialize the serial port 
by writing 0000h to BSPC. 


Clear any pending serial port in- 
terrupts by writing 0010h to IFR. 


Enable the serial port interrupts 
by ORing 0010h with IMR. 


Enable interrupts globally (if nec- 
essary) by clearing the INTM bit 
in ST1. 


Initialize the ABU receive by writ- 
ing 2160h to BSPCE. 


Write the buffer start address to 
ARR. 


Write the buffer size to BKR. 


Start the serial port by writing 
0080h to BSPC. 


Description 


This places the receive portion of the serial port in reset and sets up 


the serial port to operate in continuous receive mode with 16-bit 
words. 


Eliminate any interrupts that may have occurred before initialization. 


Enable receive interrupts. 


Interrupts must be globally enabled for the CPU to respond. 


This causes the BSP to receive continuously and not restart if a new 
FSR is received. 


Identify the first buffer address to the ABU. 


Identify the buffer size to the ABU. 


This takes the receive portion of the serial port out of reset and starts 
operations with the conditions defined in steps 1 and 5. 


9.3.4 Buffer Misalignment Interrupt (BMINT) — 549 only 


BMINT is generated when a frame sync occurs and the ABU transmit or re- 
ceive buffer pointer is not at the top of the the buffer address. This is useful for 
detecting several potential error conditions on the serial interface, including 
extraneous and missed clocks and frame sync pulses. A BMINT interrupt, 
therefore, indicates that one or more words may have been lost on the serial 


interface. 


BMINT is useful for detecting buffer misalignment only when the buffer point- 
er(s) are initially loaded with the top of the buffer address and a frame of data 
contains the same number of words as the buffer length. These are the only 
conditions under which a frame sync occurring at a buffer address other than 
the top of the buffer constitute an error condition. In cases where these condi- 
tions are met, a frame sync always occurs when the buffer pointer is at the top 
of the buffer address (if the interface is functioning properly). 


If BMINT is enabled under conditions other than those described, interrupts 
can be generated under circumstances other than actual buffer misalignment. 
In these cases, BMINT should generally be masked in the IMR register so that 
the processor ignores this interrupt. 
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BMINT is available when the device is operating in the auto-buffering mode 
with continuous transfers, the FIG bit cleared to 0, and with external serial 
clocks or frames. 


9.3.5 BSP Operation in Power-Down Mode 


The ’54x offers several power down modes which allow part or all of the device 
to enter a dormant state and dissipate considerably less power than when run- 
ning normally. Power down mode may be invoked in several ways, including 
either executing the IDLE instruction or driving the HOLD input low with the HM 
status bit set to 1. The BSP, like other peripherals (timer, standard serial port), 
can take the CPU out of IDLE using the transmit interrupt (BXINT) or receive 
interrupt (BRINT). 


When in IDLE or HOLD mode, the BSP continues to operate, as is the case 
with the serial port. When in IDLE2/3, unlike the serial port and other on-chip 
peripherals which are stopped with this power-down mode, the BSP can still 
be operated. 


In standard mode, if the BSP is using external clock and frame sync while the 
device is in IDLE2/3, the port will continue to operate, and a transmit interrupt 
(BXINT) or receive interrupt (BRINT) will take the device out of IDLE2/3 mode 
if INTM = 0 before the device executes the IDLE 2 or IDLE 3 instruction. With 
internal clock and/or frame sync, the BSP remains in IDLE2/3 until the CPU 
resumes operation. 


In autobuffering mode, if the BSP is using external clock and frame sync while 
the device is in IDLE2/3, a transmit/receive event will cause the internal BSP 
clock to be turned on for the cycles required to perform the DXR (or DRR) to 
memory transfer. The internal BSP clock is then turned off automatically as 
soon as the transfer is complete so the device will remain in IDLE2/3. The 
device is awakened from IDLE2/3 by the ABU transmit interrupt (BXINT) or 
receive interrupt (BRINT) when the transmit/receive buffer has been halfway 
or entirely emptied or filled if INTM = 0 before the device executes the IDLE 2 
or IDLE 3 instruction. 


Serial Ports 9-55 


Time-Division Multiplexed (TDM) Serial Port Interface 


9.4 Time-Division Multiplexed (TDM) Serial Port Interface 


9.4.1 


The time-division multiplexed (TDM) serial port allows the ’54x device to 
communicate serially with up to seven other devices. The TDM port, therefore, 
provides a simple and efficient interface for multiprocessing applications. 


The TDM serial port is a superset of the serial port described in section 9.2 on 
page 9-4. By means of the TDM bit in the TDM serial port control register 
(TSPC), the port can be configured in multiprocessing mode (TDM = 1) or 
stand-alone mode (TDM = 0). When in stand-alone mode, the port operates 
as described in section 9.2. When in multiprocessing mode, the port operates 
as described in this section. The port can be shut down for low power 
consumption via the XRST and RRST bits, as described in section 9.2. 


Basic Time-Division Multiplexed Operation 


Time-division multiplexing is the division of time intervals into a number of sub- 
intervals, with each subinterval representing a communications channel 
according to a prespecified arrangement. Figure 9-30 shows a 4-channel 
TDM scheme. Note that the first time slot is labeled chan 1 (channel 1), the next 
chan 2 (channel 2), etc. Channel 1 is active during the first communications 
period and during every fourth period thereafter. The remaining three channels 
are interleaved in time with channel 1. 


Figure 9-30. Time-Division Multiplexing 
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The ’54x TDM port uses eight TDM channels. Which device is to transmit and 
which device or devices is/are to receive for each channel may be indepen- 
dently specified. This results in a high degree of flexibility in interprocessor 
communications. 


9.4.2 TDM Serial Port Interface Registers 
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The TDM serial port operates through six memory-mapped registers and two 
other register (TRSR and TXSR) that are not directly accessible to the pro- 
gram, but are used in the implementation of the double-buffering capability. 
These eight registers are listed in Table 9-13. 
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Table 9-13. TDM Serial Port Registers 


Address Register Description 
t TRCV TDM data receive register 
t TDXR TDM data transmit register 
T lope TDM serial port control register 
tT TCSR TDM channel select register 
7 TRTIA TDM receive/transmit address register 
t TRAD TDM receive address register 
— TRSR TDM data receive shift register 
— TXSR TDM data transmit shift register 


t See section 8.2, Peripheral Memory-Mapped Registers. 


L) 


TDM data receive register (TRCV). The 16-bit TDM data receive register 
(TRCV) holds the incoming TDM serial data. The TRCV has the same 
function as the DRR, described on page 9-5. 


TDM data transmit register (TDXR). The 16-bit TDM data transmit register 
(TDXR) holds the outgoing TDM serial data. The TDXR has the same 
function as the DXR, described on page 9-5. 


TDM serial port control register (TSPC). The 16-bit TDM serial port control 
register (TSPC) contains the mode control and status bits of the TDM 
serial port interface. The TSPC is identical to the SPC (Figure 9-3) except 
that bit 0 serves as the TDM mode enable control bit in the TSPC. The 
TDM bit configures the port in TDM mode (TDM = 1) or stand-alone mode 
(TDM = 0). In stand-alone mode, the port operates as a standard serial 
port as described on page 9-4. 


TDM channel select register (TCSR). The 16-bit TDM channel select 
register (TCSR) specifies in which time slot(s) each 54x device is to 
transmit. 


TDM receive/transmit address register (TRTA). The 16-bit TDM receive/ 
transmit address register (TRTA) specifies in the eight LSBs (RAO—RA7) 
the receive address of the ’54x device and in the eight MSBs (TAO-TA7) 
the transmit address of the 54x device. 


TDM receive address register (TRAD). The 16-bit TDM receive address 
register (TRAD) contains various information regarding the status of the 
TDM address line (TADD). 
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Lj TDM data receive shift register (TRSR). The 16-bit TDM data receive shift 
register (TRSR) controls the storing of the data, from the input pin, to the 
TRCV. The TRSR has the same function as the RSR, described on page 
9-5. 


Lj TDM data transmit shift register (TXSR). The 16-bit TDM data transmit 
shift register (TXSR) controls the transfer of the outgoing data from the 
TDXR and holds the data to be transmitted on the data-transmit (TDX) pin. 
The TXSR has the same function as the XSR, described on page 9-5. 


9.4.3. TDM Serial Port Interface Operation 


Figure 9-31(a) shows the ’54x TDM port architecture. Up to eight devices can 
be placed on the four-wire serial bus. This four-wire bus consists of a conven- 
tional serial port’s bus of clock, frame, and data (TCLK, TFRM, and TDAT) 
wires plus an additional wire (TADD) that carries the device addressing 
information. Note that the TDAT and TADD signals are bidirectional signals 
and are often driven by different devices on the bus during different time slots 
within a given frame of operation. 


Figure 9-31. TDM 4-Wire Bus 
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The TADD line, which is driven by a particular device for a particular time slot, 
determines which device(s) in the TDM configuration should execute a valid 
TDM receive during that time slot. This is similar to a valid serial port read 
operation, as described in section 9.2, Serial Port Interface, on page 9-4 
except that some corresponding TDM registers are named differently. The 
TDM receive register is TRCV, and the TDM receive shift register is TRSR. 
Data is transmitted on the bidirectional TDAT line. 


Note that in Figure 9—31(b) the device TDX and TDR pins are tied together 
externally to form the TDAT line. Also, note that only one device can drive the 
data and address line (TDAT and TADD) in a particular slot. All other devices’ 
TDAT and TADD outputs should be in the high-impedance state during that 
slot, which is accomplished through proper programming of the TDM port 
control registers (this is described in detail later in this section). Meanwhile, in 
that particular slot, all the devices (including the one driving that slot) sample 
the TDAT and TADD lines to determine if the current transmission represents 
valid data to be read by any one of the devices on the bus (this is also 
discussed in detail later in this section). When a device recognizes an address 
to which it is supposed to respond, a valid TDM read then occurs, the value 
is transferred from TRSR to TRCV. A receive interrupt (TRINT) is generated, 
which indicates that TRCV has valid receive data and can be read. 


All TDM port operations are synchronized by the TCLK and TFRM signals. 
Each of them are generated by only one device (typically the same device), 
referred to as the TCLK and TFRM source(s). The word master is not used 
here because it implies that one device controls the other, which is not the 
case, and TCSR must be set to prevent slot contention. Consequently, the 
remaining devices in the TDM configuration use these signals as inputs. 
Figure 9-31 (b) shows that TCLKX and TCLKR are externally tied together to 
form the TCLK line. Also, TFRM and TADD originate from the TFSX and TFSR 
pins respectively. This is done to make the TDM serial port also easy to use 
in standard mode. 


TDM port operation is controlled by six memory-mapped registers. The layout 
of these registers is shown in Figure 9-32. The TRCV and TDXR registers 
have the same functions as the DRR and DXR registers respectively, 
described in section 9.2, Serial Port Interface. The TSPC is identical to the 
SPC except that bit 0 serves as the TDM mode enable control bitin the TSPC. 
This bit configures the port in TDM mode (TDM = 1) or stand-alone mode 
(TDM = 0). In stand-alone mode, the port operates as a standard serial port 
as described in section 9.2. Refer to section 9.4.6, Examples of TDM Serial 
Port Interface Operation, on page 9-64 for additional information about the 
function of the bits in these registers. 
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Figure 9-32. TDM Serial Port Registers Diagram 
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When TDM mode is selected, the DLB and FO bits in the TSPC are 
hard-configured to 0, resulting in no access to the digital loopback mode and 
in a fixed word length of 16 bits (a different type of loopback is discussed in the 
example in section 9.4.6 on page 9-64). Also, the value of FSM does not affect 
the port when TDM = 1, and the states of the underflow and overrun flags are 
indeterminate (section 9.4.5, TDM Serial Port Interface Exception Conditions, 
on page 9-64 explains how exceptions are handled in TDM mode). If TDM = 1, 
changes made to the contents of the TSPC become effective upon completion 
of channel 7 of the current frame. Thus the TSPC value cannot be changed 
for the current frame; any changes made will take effect in the next frame. 


The source device for the TCLK and TFRM timing signals is set by the MCM 
and TXM bits, respectively. The TCLK source device is identified by setting the 
MCM bit of its TSPC to 1. Typically, this device is the same one that supplies 
the TDM port clock signal TCLK. The TCLKX pin is configured as an input if 
MCM = 0 and an output if MCM = 1. In the latter case (internal 54x clock), the 
device whose MCM = 1 supplies the clock (TCLK frequency = one fourth of 
CLKOUT frequency) for all devices on the TDM bus. The clock can be supplied 
by an external source if MCM = 0 for all devices. TFRM can also be supplied 
externally if TXM = 0. An external TFRM, however, must meet TDM receive 
timing specifications with respect to TCLK for proper operation. No more than 
one device should have MCM or TXM set to 1 at any given time. The 
specification of which device is to supply clock and framing signals is typically 
made only once, during system initialization. 


The TDM channel select register (TCSR) of a given device specifies in which 
time slot(s) that device is to transmit. A 1 in any one or more of bits 0—7 of the 
TCSR sets the transmitter active during the corresponding time slot. Again, a 
key system-level constraint is that no more than one device can transmit 
during the same time slot; devices do not check for bus contention, and slots 
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must be consistently assigned. As in TSPC operation, a write to TCSR during 
a particular frame is valid only during the next frame. However, a given device 
can transmit in more than one slot. This is discussed in more detail in section 
9.4.4, TDM Mode Transmit and Receive Operations, on page 9-62 with an 
emphasis on the utilization of TRTA, TDXR, and TCSR in this respect. 


The TDM receive/transmit address register (TRTA) of a given device specifies 
two key pieces of information. The lower half specifies the receive address of 
the device, while the upper half of TRTA specifies the transmit address. The 
receive address (RA7—RAO, refer to Figure 9-32) is the 8-bit value that a 
device compares to the 8-bit value it samples on the TADD line in a particular 
slot to determine whether it should execute a valid TDM receive. The receive 
address, therefore, establishes the slots in which that device may receive, 
dependent on the addresses present in those slots, as specified by the 
transmitting devices. This process occurs on each device during every slot. 


The transmit address (TA7—TAO, refer to Figure 9-32) is the address that the 
device drives onthe TADD line during atransmit operation on an assigned slot. 
The transmit address establishes which receiving devices may execute a valid 
TDM receive on the driven data. 


Only one device at a time can drive a transmit address on TADD. Each 
processor bit-wise-logically-ANDs the value it samples on the TADD line with 
its receive address (RA7—RAO). If this operation results in a nonzero value, 
then a valid TDM receive is executed on the processor(s) whose receive 
addresses match the transmitted address. Thus, for one device to transmit to 
another, there must be at least one bit in the upper half of the transmitting 
device’s TRTA (the transmit address) with a value of 1 that matches one bit 
with a value of 1 inthe lower half of TRTA (the receive address) of the receiving 
device. This method of configuration of TRTA allows one device to transmit to 
one or more devices, and for any one device to receive from one or more than 
one transmitter. This can also allow the transmitting device to control which 
devices receive, without the receive address on any of the devices having to 
be changed. 


The TDM receive address register (TRAD) contains various information 
regarding the status of the TADD line which can be polled to verify the previous 
values of this signal and to verify the relationship between instruction cycles 
and TDM port timing. 


Bits 138-11 (X2—X0) contain the current slot number value, regardless of 


whether a valid data receive was executed in that slot or not. This value is 
latched at the beginning of the slot and retained only until the end of the slot. 


Serial Ports 9-61 


Time-Division Multiplexed (TDM) Serial Port Interface 


Bits 10-8 (SO2—SO) hold the number of the last slot plus one (modulo eight) 
in which data was received (that is, if the last valid data read occurred in slot 5 
in the previous frame, these bits would contain the number six). This value is 
latched during the TDM receive interrupt (TRINT) at the end of the slot in which 
the last valid data receive occurred, and maintained until the end of the next 
slot in which a valid receive occurs. 


Bits 7-0 (A7—A0O) hold the last address sampled on the TADD line, regardless 
of whether a valid data receive was executed or not. This value is latched half- 
way through each slot (so the value on the TADD may be shifted in) and main- 
tained until halfway through the next slot, whether a valid receive is executed 
or not. 


9.4.4 TDM Mode Transmit and Receive Operations 


Figure 9-33 shows the timing for TDM port transfers. The TCLK and TFRM 
signals are generated by the timing source device. The TCLK frequency is one 
fourth the frequency of CLKOUT if generated by a 54x device. The TFRM 
pulse occurs every 128 TCLK cycles and is timed to coincide with bit 0 of slot 7, 
which is the last bit of the previous frame. The relationship of TFRM and TCLK 
allows 16 data bits for each of eight time slots to be driven on the TDAT line, 
which also permits the processor to execute a maximum of 64 instructions dur- 
ing each slot, assuming that a’54x internal clock is used. Beginning with slot 0 
and with the MSB first, the transmitter drives 16 data bits for each slot, with 
each bit having a duration of one TCLK cycle, with the exception of the first bit 
of each slot, which lasts only half of one bit time. Note that data is both clocked 
onto the TDAT line by the transmitting device and sampled from the TDAT line 
by receiving devices on the rising edge of TCLK (see the data sheet for 
detailed TDM interface timings). 


Figure 9-33. Serial Port Timing (TDM Mode) 
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Simultaneous with data transfer, the transmitting device also drives the TADD 
line with the transmit address for each slot. This information, unlike that on 
TDAT, is only one byte long and is transmitted with the LSB first for the first half 
of the slot. During the second half of the slot (that is, the last eight TCLK peri- 
ods) the TADD line is driven high. The TDM receive logic samples the TADD 
line only for the first eight TCLK periods, ignoring it during the second half of 
the slot. Therefore, the transmitting device (if not a’54x) could drive TADD high 
or low during that time period. Note that, like TDAT, the first TADD bit trans- 
mitted lasts for only one half of one TCLK cycle. 


If no device on the TDM bus is configured to transmit in a slot (that is, none of 
the devices has a 1 for the corresponding slot in their TCSR), that slot is con- 
sidered empty. In an empty slot, both TADD and TDAT are high impedance. 
This condition has the potential for sourious receives, however, because TDAT 
and TADD are always sampled, and a device performs a valid TDM reception 
if its receive address matches the address on the TADD line. To avoid spurious 
reads, a 1-kilohm pull-down resistor mustbe tied to the TADD line. This causes 
the TADD line to read low on empty slots. Otherwise, any noise on the TADD 
line that happens to match a particular receive address would result in a spuri- 
ous read. If power dissipation is a concern and the resistor is not desired, then 
an arbitrary processor with transmit address equal to Oh can drive empty slots 
by writing to TDXR in those slots. Slot manipulation is explained later in this 
section. The 1-kilohm resistor is not required on the TDAT line. 


An empty TDM slot can result in the following cases: the first obvious case, as 
mentioned above, occurs when no device has its TCSR configured to transmit 
in that slot. A second more subtle case occurs when TDXR has not been 
loaded before atransmit slot in a particular frame. This may also happen when 
the TCSR contents are changed, since the actual TCSR contents are not 
updated until the next TFRM pulse occurs. Therefore, any subsequent change 
takes effect only in the next frame. The same is true for the receive address 
(the lower half of TRTA). The transmit address (upper half of TRTA), however, 
and TDXR, clearly, may be changed within the current frame for a particular 
slot, assuming that the slot has not yet been reached when the instruction to 
load the TRTA or TDXR is executed. Note that it is not necessary to load the 
transmit address each time TDXR is loaded; when a TDXR load occurs and 
a transmission begins, the current transmit address in TRTA is transmitted on 
TADD. 


The current slot number may be obtained by reading the X2—X0 bits in TRAD. 
This affords the flexibility of reconfiguring the TDM port on a slot-by-slot basis, 
and even slot sharing if desired. The key to utilizing this capability is to under- 
stand the timing relationship between the instructions being executed and the 
frame/slots of the TDM port. If the TDM port is to be manipulated on a slot-by- 
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slot basis, changes must be made to appropriate registers quickly enough for 
the desired effect to take place at the desired time. It is also important to take 
into account that the TCSR and the receive address (lower half of TRTA) take 
effect only at the start of a new frame, while the transmit address (upper half 
of TRTA) and TDXR (transmit data) can take effect at the start of a new slot, 
as mentioned previously. 


Note that if the transmit address is being changed on the fly, care should be 
exercised not to corrupt the receive address, since both addresses are located 
in the TRTA, thus maintaining the convention of allowing the transmitting 
device to specify which devices can receive. 


9.4.5 TDM Serial Port Interface Exception Conditions 


Because of the nature of the TDM architecture, with the ability for one proces- 
sor to transmitin multiple slots, the concepts of overrun and underflow become 
indeterminate. Therefore, the overrun and underflow flags are not active in 
TDM mode. 


In the receiver, if TRCV has not been read and a valid receive operation is initi- 
ated (because of the value on TADD and the device’s receive address), the 
present value of TRCV is overwritten; the receiver is not halted. On the other 
hand, if TDXR has not been updated before a transmission, the TADD or TDAT 
lines are not driven, and these pins remain in the high-impedance state. This 
mode of operation prevents spurious transmits from occurring. 


If a TFRM pulse occurs at an improper time during a frame, the TDM port is 
not able to continue functioning properly, since slot and bit numbers become 
ambiguous when this occurs. Therefore, only one TFRM should occur every 
128 TCLK cycles. Unlike the serial port, the TDM port cannot be reinitialized 
with a frame sync pulse during transmission. To correct an improperly timed 
TFRM pulse, the TDM port must be reset. 


9.4.6 Examples of TDM Serial Port Interface Operation 
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The following is anexample of TDM serial port operation, showing the contents 
of some of the key device registers involved, and explaining the effect of this 
configuration on port operation. In this example, eight devices are connected 
to the TDM serial port as shown in Figure 9-34. 


Table 9-14 shows the TADD value during each of the eight channels given the 
transmitter and receiver designations shown. This example shows the config- 
uration for eight devices to communicate with each other. In this example, 
device 0 broadcasts to all other device addresses during slot 0. In subsequent 
time slots, devices 1-7 each communicate to one other processor. 
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Figure 9-34. TDM Example Configuration Diagram 


Device 0 Device 1 Device 2 Device 3 
Channel 0 (TCSR=01h) Channel 7 (TCSR=80h) Channel 6 (TCSR=40h) Channel 5 (TCSR=20h) 
TA=FEh, RA=0th TA=01h, RA=04h TA=02h, RA=04h TA=04h, RA=08h 


Device 4 Device 5 Device 6 Device 7 
Channel 4 (TCSR=10h) Channel 3 (TCSR= 08h) Channel 2 (TCSR= 04h) Channel 1 (TCSR= 02h) 
TA=08h, RA=10h TA=10h, RA=20h TA=20h, RA=40h TA=40h, RA=80h 


Table 9-14. Interprocessor Communications Scenario 


Transmitter Receiver 
Channel TADD Data Device Device(s) 
0 FEh 0 1-7 
1 40h 7 6 
2 20h 6 5 
3 10h 5 4 
4 08h 4 3 
5 04h 3 2 
6 02h 2 1 
7 Oth 1 0 


Table 9-15 shows the TDM serial port register contents of each device that 
results in the scenario given in Table 9-14. Device 0 provides the clock and 
frame control signals for all channels and devices. The TCSR and TRTA con- 
tents specify which device is to transmit on a given channel and which devices 
are to receive. 
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Table 9-15. TDM Register Contents 
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Device TSPC TRTA TCSR 
0 xxF9h FEOth xx01h 
1 xxC9h 0102h xx80h 
2 xxC9h 0204h xx40h 
3 xxC9h 0408h xx20h 
4 xxC9h 0810h xx10h 
5 xxC9h 1020h xx08h 
6 xxC9h 2040h xx04h 
7 xxC9h 4080h xx02h 


In this example, the transmit address of a given device (the upper byte of 
TRTA) matches the receive address (the lower byte of TRTA) of the receiving 
device. Note, however, that it is not necessary for the transmit and receive 
addresses to match exactly; the matching operation implemented in the 
receiver is a bitwise AND operation. Thus, it is only necessary that one bit in 
the field matches for a receive to occur. The advantage of this scheme is that 
atransmitting device can select the device or devices to receive its transmitted 
data by simply changing its transmit address (as long as each devices’ receive 
address is unique, the receive address of the receiving device does not need 
to be changed). In the example, device 0 can transmit to any combination of 
the other devices by merely writing to the upper byte of TRTA. Therefore, if a 
transmitting device changed its TRTA to 8001h on the fly, it would transmit only 
to device 7. 


A device may also transmit to itself, because both the transmit and receive 
operations are executed on the rising edge of TCLK (see the 54x data sheet 
for TDM interface timings). To enable this type of loopback, it is necessary to 
use the standard TDM port interface connections as shown in Figure 9-31. 
Then, if device 0 has a TRTA of 0101h, it would transmit only to itself. 


As an illustration of the proper operation of a TDM serial port, Example 9-5 
through Example 9-8 define a sequence of actions. This illustration is based 
on the use of interrupts to handle the normal I/O between the serial port and 
CPU. The ’542 peripheral configuration has been used as a reference for 
these examples. 


In Example 9-5 the procedure for a one-way transmit of a sequence of values 
from device 0 to device 1 is shown. Device 0 transmits in slot 0 and has a 
transmit address of 01h. Example 9—7 shows the procedure for device 1. It 
has a receive address of 01h. 
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Example 9-5. TDM Serial Port Transmit Initialization Routine 


Action 


Reset and initialize the TDM seri- 
al port by writing 0039h to TSPC. 


Clear any pending TDM serial 
port transmit interrupts by writing 
0080h to IFR. 


Enable the TDM serial port inter- 
rupts by ORing 0080h with IMR. 


Enable interrupts globally (if nec- 
essary) by clearing the INTM bit 
in ST1. 


Write 0001h to TCSR. 
Write 0100h to TRTA. 


Start the serial port by writing 
0049h to TSPC. 


Perform a handshake to verify 
that the receiving device is ready 
to receive data. 


Write the first data value to TDXR 
(if not already done in step 8). 


Description 


This places both the transmit and receive portions of the TDM serial 


port in reset and sets up the serial port to operate with internally 
generated TFRM and TCLK signals in TDM mode. 


Eliminate any interrupts that may have occurred before initialization. 


Enable transmit interrupts. 


Interrupts must be globally enabled for the CPU to respond. 


This selects time slot 0 as the transmission time slot for this device. 


This sets up this device to transmit data to the device receiving at 
address 01h. It also sets up this device to ignore all received data. 


This takes the transmit portion of the serial port out of reset and starts 
operations with the conditions defined in steps 1, 5 and 6. 


For asingle device pair, this could make use of BIO and XF. For sever- 
al devices this might mean that the device generating TFRM and 
TCLK broadcasts a command to all other devices until each one 
returns an acknowledge. 


This initiates serial port transmit operations since TADD and TDAT 
are not driven if new data is not written to TDXR. 


Example 9-6. TDM Serial Port Transmit Interrupt Service Routine 


Action 


Save any context that may be 
modified on the stack. 


Write TDXR with a new value 
from a predetermined location in 
memory. 


Restore the context that was 
saved in step 1. 


Return from the ISR with an 
RETE to reenable interrupts. 


Description 


The operating context of the interrupted code must be maintained. 


Write the new transmit data for the ISR. 


The operating context of the interrupted code must be maintained. 


Interrupts must be reenabled for the CPU to respond to the next 
interrupt. 
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Example 9-7. TDM Serial Port Receive Initialization Routine 


Action 


Reset and initialize the TDM seri- 
al port by writing 0009h to TSPC. 


Clear any pending TDM serial 
port receive interrupts by writing 
0040h to IFR. 


Enable the TDM serial port inter- 
rupts by ORing 0040h with IMR. 


Enable interrupts globally (if nec- 
essary) by clearing the INTM bit 
in ST1. 


Write 0000h to TCSR. 
Write 0001h to TRTA. 


Perform a handshake to notify 
the transmitting device that it is 
okay to send data. 


Description 


This places both the transmit and receive portions of the TDM serial 


port in reset and sets up the serial port to operate with externally 
generated TFRM and TCLK signals in TDM mode. 


Eliminate any interrupts that may have occurred before initialization. 


Enable receive interrupts. 


Interrupts must be globally enabled for the CPU to respond. 


This sets up this device to not transmit in any time slot. 


This sets up this device to not address any device. It also sets up this 
device to receive data sent to address 01h. 


Forasingle device pair, this could make use of BIO and XF. For sever- 
al devices, this might mean that the device waits for a broadcast com- 
mand and then returns an acknowledge. 


Example 9-8. TDM Serial Port Receive Interrupt Service Routine 


9-68 


Action 


Save any context that may be 
modified on the stack. 


Read TDRR and write the value 
to a predetermined location in 
memory. 


Restore the context that was 
saved in step 1. 


Return from the ISR with an 
RETE to reenable interrupts. 


Description 


The operating context of the interrupted code must be maintained. 


Read the new received data for the ISR. 


The operating context of the interrupted code must be maintained. 


Interrupts must be reenabled for the CPU to respond to the next 
interrupt. 


Chapter 10 


External Bus Operation 


This chapter describes the external bus operation and control for memory and 
I/O accesses. Some of the external bus operation and control features of the 
’54x include software wait states, bank-switching logic, and hold logic. The 
54x supports a wide range of system interfacing requirements. 


The 5410 enhanced external parallel interface (XIO) is not described in this 
chapter. See the 5410 datasheet for details about the external memory inter- 


face. 
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10.1 External Bus Interface 


Table 10-1. 


Signal Name 
AO-A15 
DO-D15 
MSTRB 

PS 

DS 

IOSTRB 


The ’54x external interface consists of data buses, address buses, and a set 
of control signals for accessing off-chip memory and I/O ports. Table 10-1 lists 
key signals for the external interface. 


Key External Interface Signals 


542, ao 545, °548, 549 5402 5420 

B46 6410 Description 

15-0 22-0 19-0 17-0 Address bus 

15-0 15-0 15-0 15-0 Data bus 
al al a pies External memory access strobe 
al al al Val Program space select 
ir val i a Data space select 
al al i in I/O access strobe 
al al i al I/O space select 
al al in a Read/write signal 
al al al Val Data ready to complete cycle 
yn \- al Hold request 
al al val Hold acknowledge 
i“ al a Micro state complete 
al al in Instruction acquisition 
al al in Interrupt acknowledge 


The parallel interface consists of two mutually-exclusive interfaces controlled 
by the MSTRB and IOSTRB signals. MSTRB is activated for memory 
accesses (program or data), and IOSTRB is used to access I/O ports. The R/W 
signal controls the direction of the accesses. 


The external ready input signal (READY) and the software-generated wait 
states allow the processor to interface with memory and I/O devices of varying 
speeds. When communicating with slower devices, the CPU waits until the 
other device completes its function and sends the READY signal to continue 
execution. 
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In some cases, wait states are needed only when transitions are made 
between two external memory devices. The programmable bank-switching 
logic provides automatic insertion of a wait state in these situations. 


The hold mode allows an external device to take control of the 54x external 
buses to access the resources in the 54x external program, data, and I/O 
memory spaces. Two hold mode types, normal mode and concurrent DMA 
mode are available. 


When the CPU addresses internal memory, the data bus is placed in the high- 
impedance state. However, the address bus and the memory-select signals 
(program select (PS), data select (DS), and I/O select (IS)) maintain the 
previous state. The MSTRB, IOSTRB, R/W, IAQ, and MSC signals remain 
inactive. If the address visibility mode (AVIS) bit, located in the PMST, is set 
to 1, the internal program address is placed on the address bus with an active 
IAQ. 


When the CPU addresses external data or I/O space, the extended address 
lines are driven to logic 0. This is also the case when the CPU addresses inter- 
nal memory with the AVIS (address visibility) set to 1. 
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10.2 External Bus Priority 


The 54x CPU has one program bus (PB), three data buses (CB, DB, and EB), 
and four address buses (PAB, CAB, DAB, and EAB). The CPU can access its 
buses simultaneously because of its pipelined structure; however, the external 
interface can support only one access per cycle. A pipeline conflict occurs if, 
inasingle cycle, the CPU accesses external memory twice to fetch an instruc- 
tion, a data-memory operand, or an external I/O device. This pipeline conflict 
is automatically resolved by a predetermined priority, defined by the stage of 
the pipeline. 


Figure 10—1 shows multiple CPU accesses to fetch an instruction, and to write 
and read data operands over the external interface in one cycle. Data 
accesses have a higher priority than program-memory fetches: the program- 
memory fetch cannot begin until all CPU data accesses are completed. 


Figure 10-1. External Bus Interface Priority 
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Pipeline conflicts occur when the program and data are in external memory 
and a single-operand write instruction is followed by a dual-operand read or 
a 32-bit operand read. The following sequence of instructions shows the pipe- 
line conflict discussed. 

ST T, *AR6 ;Smem write operation 


LD *AR4+,A ;Xmem and Ymem read operation 
| |MAC *AR5+, B 


Chapter 7, Pipeline, describes pipeline operation and conflicts in detail. 
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10.3 External Bus Control 


Two units in the ’54x control the external bus: the wait-state generator and the 
bank-switching logic. These units are controlled by two registers, the software 
wait-state register (SWWSR) and the bank-switching control register (BSCR). 


10.3.1 Wait-State Generator 


The software-programmable wait-state generator can extend external bus 
cycles by up to seven machine cycles (14 machine cycles on ’549, 5402, 5410 
and ’5420 devices), providing a convenient means to interface the 54x to slow- 
er external devices. Devices that require more than seven wait states can be 
interfaced using the hardware READY line. When all external accesses are 
configured for zero wait states, the internal clocks to the wait-state generator 
are shut off. Shutting off these paths from the internal clocks allows the device 
to run with lower power consumption. 


The software-programmable wait-state generator is controlled by the 16-bit 
software wait-state register (SWWSR), which is memory-mapped to address 
0028h in data space. 


The program and data spaces each consist of two 32K-word blocks; the I/O 
space consists of one 64K-word block. Each of these blocks has a correspond- 
ing 3-bit field in the SWWSR. These fields are shown in Figure 10-2 and 
described in Table 10-2. The SWWSR bit fields of the 548, ’549, ’5402, 5410, 
and ’5420 are described in Table 10-3. 


The value of a 3-bit field in SWWSR specifies the number of wait states to be 
inserted for each access in the corresponding space and address range. The 
minimum value, which adds no wait states, is 0 (000b). A value of 7 (111b) 
provides the maximum number of wait states. 


Figure 10-2. Software Wait-State Register (SWWSR) Diagram 


15 14-12 11-9 8-6 5-3 2-0 
R R/W R/W R/W R/W R/W 


Tt XPA bit on 548, 549, 5402, 5410, and 5420 only 
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Table 10-2. Software Wait-State Register (SWWSR) Bit Summary 


Bit 
15 


14-12 


11-9 


2-0 


Reset 

Name Value Function 

Reserved 0 Reserved. In the 548, and ’549, this bit changes the operation of the program 
fields (see Table 10-3). 

/O 1 I/O space. The field value (0—7) corresponds to the number of wait states for I/O 
space 0000—FFFFh. 

Data 1 Data space. The field value (O—7) corresponds to the number of wait states 
for data space 8000—FFFFh. 

Data 1 Data space. The field value (0O—7) corresponds to the number of wait states 
for data space 0000—7FFFh. 

Program 1 Program space. The field value (0—7) corresponds to the number of wait 
states for program space 8000—FFFFh. 

Program 1 Program space. The field value (0O—7) corresponds to the number of wait 


states for program space 0000—7FFFh. 


Figure 10-3. Software Wait-State Control Register (SWCR) Diagram 


15 1 0 


xs Suh 


When SWWSNM is set to 1, the wait states are multiplied by two extending the 
maximum number of wait states from 7 to 14. 


The '549, 5402, ’5410, and 5420 have an extra bit (software wait-state multi- 
plier, SWSM) that resides in SWCR, which is memory mapped to address 
002Bh in data space. 
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Table 10-3. °548/549/'5402/54 10/5420 Software Wait-State Register 


Bit 
15 


5-3 


(SWWSR) Bit Summary 
Reset 

Name Value Function 

XPA 0 Extended program address control. Selects the address ranges selected by the 
program fields. 

0 1 I/O space. The field value (0O—7) corresponds to the number of wait states for I/O 
space 0000—-FFFFh. 

Data 1 Data space. The field value (O—7) corresponds to the number of wait states 
for data space 8000—FFFFh. 

Data 1 Data space. The field value (O—7) corresponds to the number of wait states 
for data space 0000—7FFFh. 

Program 1 Program space. The field value (0O—7) corresponds to the number of wait 
states for: 
[J] XPA = 0: xx8000 — xxFFFFh 
[J XPA = 1: 400000h—7FFFFF 

Program 1 Program space. The field value (O—7) corresponds to the number of wait 
states for: 


(J XPA = 0: xx0000—xx7FFFh 
(J XPA = 1: 000000-3FFFFFh 


Figure 10-4 is a block diagram of the wait-state generator logic for external 
program space. When an external program access is decoded, the appropri- 
ate field of the SWWSR is loaded into the counter. If the field is not 000, a 
not-ready signal is sent to the CPU and the wait-state counter is started. The 
not-ready condition is maintained until the counter decrements to 0 and the 
external READY line is set high. The external READY and the wait-state 
READY are ORed together to generate the CPU WAIT signal. The READY line 
is machine-sampled at the falling edge of CLKOUT. The processor detects 
READY only if a minimum of two software wait states are programmed. The 
external READY line is not sampled until the last wait-state cycle. 
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Figure 10-4. Software Wait-State Generator Block Diagram 


decoder 


External 
logic 
3-bit 
counter 
READY 


At reset, all fields in the SWWSR are set to 1116 (SWWSR = 7FFFh), the 
maximum number of wait states for external accesses. This feature ensures 
that the CPU can communicate with slow external memories during processor 
initialization. 


Table 10-4. Number of CLKOUT1 Cycles Per Access for Various Numbers of Wait States 


Number of CLKOUT1 Cyclest 


~ Hardware Wait State -—«-Software Wait State _ 
Number of Wait States “Read -—Write “Read - Write 
0 NA NA 1 2n 
1 NA NA 2 3n 
2 3 4n 3 4n 
s 4 5n 4 sn 


t Where n is the number of consecutive write cycles. 


10.3.2 Bank-Switching Logic 


Programmable bank-switching logic allows the 54x to switch between exter- 
nal memory banks without requiring external wait states for memories that 
need several cycles to turn off. The bank-switching logic automatically inserts 
one cycle when accesses cross memory-bank boundaries inside program or 
data space. 


Bank switching is defined by the bank-switching control register (BSCR), 
which is memory-mapped at address 0029h. Figure 10-5 shows the BSCR 
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and its fields are described in Table 10—5. For more information about bank- 
switching logic, see the device-specific datasheet. 


Figure 10-5. Bank-Switching Control Register (BSCR) Diagram 


15-12 11 10-9 8 7-3 2 1 0 
R/W R/W R/W R/W R/W R/W 
Table 10-5. Bank-Switching Control Register (BSCR) Bit Summary 
Reset 
Bit Name Value Function 
15-12 BNKCMP = Bank compare. Determines the external memory-bank size. BNKCMP is used to 
mask the four MSBs of an address. For example, if BNKCMP = 1111b, the four 
MSBs (bits 12-15) are compared, resulting in a bank size of 4K words. Bank sizes 
from 4K words to 64K words are allowed. Table 10-6 on page 10-11 shows the 
relationship between BNKCMP and the address range. 
11 PS-DS - Program read—data read access. Inserts an extra cycle between consecutive 
accesses of program read and data read, or data read and program read. 
PS-DS =0 No extra cycles are inserted by this feature except when banks 
are crossed. 
PS-DS = 1 One extra cycle is inserted between consecutive accesses of 
program read and data read, or data read and program read. 
10-9 Reserved - These bits are reserved. 
8 IPIRQ - Interprocessor interrupt request bit. 
7-3 Reserved - These bits are reserved. 
2 HBH - HPI bus holder bit. 
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Table 10-5. Bank-Switching Control Register (BSCR) Bit Summary (Continued) 


Bit 
1 


Name 
BH 


Reset 
Value 


0 


Function 


Bus holder. Controls the bus holder: 


0 


EXIO 


BH =0 The bus holder is disabled. 


BH = 1 The bus holder is enabled. The data bus, D(15-0), is held in the 
previous logic level. 


External bus interface off. The EXIO bit controls the external-bus-off function. 
EXIO =0 The external-bus-off function is disabled. 


EXIO = 1 The external-bus-off function is enabled. The address bus, data 
bus, and control signals become inactive after completing the 
current bus cycle. Table 10—7 on page 10-11 lists the state of the 
signals when the external bus interface is disabled. The DROM, 
MP/MC, and OVLY bits in PMST and the HM bit in ST1 cannot 


be modified. 
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Table 10-6 summarizes the relationship between BNKCMP, the address bits 
to be compared, and the bank size. BNKCMP values not listed in the table are 
not allowed. Table 10-7 lists the state of the ports when the external bus inter- 
face is disabled (EXIO = 1). 


Table 10-6. Relationship Between BNKCMP and Bank Size 


0 
1 
1 


BNKCMP 
Bit15 Bit14 Bit13 Bit12 


0 
0 
1 


oo CO Ro 
°o fet OC ES 


— 
—— 


MSBs to Compare 


None 
15 
15-14 
15-13 
15-12 


Bank Size 
(16-Bit Words) 


64K 
32K 
16K 
8K 
4K 


Table 10-7. State of Signals When External Bus Interface is Disabled (EXIO = 1) 


Signal 
A(22-0) 
D(15-0) 
PS, DS, IS 


MSTRB, IOSTRB 


State 


Previous state 
High impedance 
High level 


High level 


Signal 
RW 
MSC 
1AQ 


State 
High level 


High level 


High level 


The EXIO and BH bits control the use of the external address and data buses. 
These bits should be set to 0 for normal operation. To reduce power dissipa- 
tion, especially if external memory is never or only infrequently accessed, 
EXIO and BH can be set to 1. 


When the EXIO bit in BSCR is set to 1, the CPU cannot modify the the HM bit 
in ST1 and cannot modify the memory map by changing the value of the 
DROM, MP/MC, and OVLY bits in PMST. 
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The ’54x has an internal register that contains the MSBs (as defined by the 
BNKCMP field) of the last address used for a read or write operation in pro- 
gram or data space. If the MSBs of the address used for the current read do 
not match those contained in this internal register, the MSTRB (memory 
strobe) signal is not asserted for one CLKOUT cycle. During this extra cycle, 
the address bus switches to the new address. The contents of the internal reg- 
ister are replaced with the MSBs for the read of the current address. If the 
MSBs of the address used for the current read match the bits in the register, 
a normal read cycle occurs. 


If repeated reads are performed from the same memory bank, no extra cycles 
are inserted. When a read is performed from a different memory bank, memory 
conflicts are avoided by inserting an extra cycle. An extra cycle is inserted only 
if a read memory access is followed by another read memory access. This 
feature can be disabled by clearing BNKCMP to 0. 


The ’54x bank-switching mechanism automatically inserts one extra cycle in 
the following cases: 


L1 A program-memory read followed by another program-memory or data- 
memory read from a different memory bank. 


_j A program-memory read followed by a data-memory read when the 
PS-DS bit is set to 1. 


Li Aprogram-memory read followed by another program-memory read from 
a different page (with the 548, 549, 5402, and ’5420). 


Lj A data-memory read followed by another program-memory or data- 
memory read from a different memory bank. 


Li A data-memory read followed by a program-memory read when the 
PS-DS bit is set to 1. 


Figure 10-6 illustrates the addition of an inactive cycle when memory banks 
are switched. 
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Figure 10-6. Bank Switching Between Memory Reads 


R/W / \ 
PS or DS \ / 


Figure 10-7 illustrates the insertion of the extra cycle between a consecutive 
program read and a data read. 


Figure 10-7. Bank Switching Between Program Space and Data Space 
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10.4 External Bus Interface Timing 


All external bus accesses complete in an integral number of CLKOUT cycles. 
One CLKOUT cycle is defined as the time from one falling edge to the next 
falling edge of CLKOUT. Some external bus accesses with no wait states, for 
example, memory writes, or I/O writes and I/O reads, take two cycles. Memory 
reads take one cycle; however, if a memory read follows a memory write, or 
vice versa, the memory read takes an additional half-cycle. The following sec- 
tions discuss zero wait-state accesses, unless otherwise specified. 


10.4.1 Memory Access Timing 
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The MSTRB signal is low for the active portion of reads and writes; an active 
portion of a memory access lasts (at least) one CLKOUT cycle. In addition, a 
transition CLKOUT cycle occurs before and after the active portion for writes. 
During this transition cycle: 


L) MSTRB is high. 


GV R/W changes on CLKOUT’s rising edge when required. 


_} The address changes on CLKOUT’s rising edge in the following cases. In 
all other instances, the address changes on the CLKOUT falling edge. 


The previous CLKOUT cycle was the active portion of amemory write. 
A memory read is followed by a memory write. 

A memory read is followed by an I/O write. 

A memory read is followed by an I/O read. 


PS, DS, or IS changes, if necessary, when the address changes. 


External Bus Interface Timing 


Figure 10-8 shows a read-read-write sequence with MSTRB active and no 
wait states. The data is read as late in the cycle as possible to allow for maxi- 
mum access time from a valid address. Although the external writes take two 
cycles, internally they require only one cycle if no accesses to the external 
interface are in progress. This helps maintain processing throughput at the 
maximum level possible. 


The timing diagram illustrates these concepts: 
41 Back-to-back reads from the same bank are single-cycle accesses. 


1 MSTRB stays low during back-to-back reads. 


_) MSTRB goes high for one cycle during read-to-write transitions to frame 
the address and R/W signal changes. 


Figure 10-8. Memory Interface Operation for Read-Read-Write 


address OK KKK 


R/W \ VA 
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Figure 10-9 shows a write-write-read sequence with MSTRB active and no 
wait states. The address and data written are held valid approximately one 
half-cycle after MSTRB changes. 


The timing diagram illustrates these concepts: 


1 MSTRB goes high at the end of every write cycle to disable the memory 
while the address and/or R/W signal changes. 


Lj Each write takes two cycles. 


11 A read following a write takes two cycles. 


Figure 10-9. Memory Interface Operation for Write-Write-Read 


a ee a Soa ap 
Data Write data Write data 


T Assuming that an I/O write preceded the first memory write 
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Figure 10-10 shows a read-read-write sequence using MSTRB active and 
one wait state. Because the reads are normally one cycle, they are extended 
by one additional cycle for the wait state. However, the write, which is already 
two cycles, is extended to three cycles. 


Figure 10-10. Memory Interface Operation for Read-Read-Write (Program-Space Wait 
States) 


Data Write data 
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10.4.2 I/O Access Timing 


Figure 10-11. 


In I/O accesses, the active portion of reads and writes lasts two cycles (with 
no wait states). Otherwise, the timing for these accesses is the same as for 
memory accesses. During these cycles, the address changes on the falling 
edge of CLKOUT, except for a memory access to I/O access sequence. 
IOSTRB is low from one rising edge to the next rising edge of the CLKOUT 
cycle. 


Figure 10-11 shows a read-write-read sequence for IOSTRB with no wait 
states. For IOSTRB accesses, reads and writes require a minimum of two 
cycles. Some off-chip peripherals can change their status bits during reads or 
writes; therefore, it is important that addresses remain valid when communi- 
cating with those peripherals. For reads and writes with IOSTRB active, 
IOSTRB is completely framed by the address to meet this requirement. 


The timing diagram illustrates these concepts: 


(1 Each I/O access takes two cycles. 


1) IOSTRB goes high at the end of each access to frame address and R/W 
signal changes. 


Parallel I/O Interface Operation for Read—Write—Read 


pares XK KK 
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Figure 10-12 shows the same I/O space access with one wait-state access. 
Each read and write access is extended by an additional cycle. 


Figure 10-12. Parallel I/O Operation for Read-Write-Read (l/O-Space Wait States) 
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10.4.3. Memory and I/O Access Timing 


Figure 10-13 through Figure 10-20 show the various transitions between 
memory reads and writes, and I/O reads and writes over the external interface 
bus. 


The timing diagrams illustrate these concepts: 


Lj l/Oreads and writes take at least three cycles when they follow a memory 
read or write. 


(1 Memory reads take two cycles when they follow an I/O read or write. 
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Figure 10-13. Memory Read and I/O Write 
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Figure 10-14. Memory Read and I/O Read 
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Figure 10-15. Memory Write and I/O Write 
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Figure 10-16. Memory Write and I/O Read 
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Figure 10-17. I/O Write and Memory Write 
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Figure 10-18. I/O Write and Memory Read 
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Figure 10-19. I/O Read and Memory Write 
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Figure 10-20. I/O Read and Memory Read 
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10.5 Start-Up Access Sequences 


10.5.1 Reset 
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The ’54x transitions between active and inactive states when entering or leav- 
ing one of four modes: IDLE1, IDLE2, reset, or IDLE3. 


Entering or leaving the first two modes, IDLE1 or IDLE2, requires no special 
consideration because the clocks to both the CPU and the on-chip peripherals 
remain active. However, special considerations are necessary when entering 
or leaving the other two modes: 


_j Reset. Hardware initialization takes place. 


_) IDLE3. The device makes atransition from a state where neither the CPU 
nor the on-chip peripherals are being clocked to an active state. 


Figure 10—21 shows the reset sequence of the external bus. For proper reset 
operation, the RS signal must be active for at least two CLKOUT cycles. How- 
ever, power-up and IDLE3 power-down mode require the reset signal to be 
active for more than two CLKOUT cycles. See section 6.11, Power-Down 
Modes, on page 6-50 for more detailed information. 


When the ’54x acknowledges a reset, the CPU terminates program execution 
and forces the program counter to FF80h. The address bus is driven with 
FF80h while RS is low. 


The device enters its reset state, in reference to the external bus, according 
to three steps: 


1) Fourcycles after RS is asserted low, PS, MSTRB, and IAQ are driven high. 


2) Five cycles after RS is asserted low, R/W is driven high, the data bus (if 
driven) goes into the high-impedance state, and the address bus is driven 
with OOFF80h. 


3) The device enters its reset state. 


When reset becomes inactive, program execution starts from the program 
memory location FF80h. The instruction acquisition signal (IAQ) and the inter- 
rupt acknowledge signal (IACK) become active, as shown in Figure 10-21, 
regardless of the state of the MP/MC signal. 


Start-Up Access Sequences 


The device enters its active state, in reference to the external bus, according 
to three steps: 


1) Five cycles after RS is asserted high, PS is driven low. 


2) Six cycles after RS is asserted high, MSTRB and IACK are driven low. 


3) One half-cycle later, the device is ready to read data and the device moves 
into its active state. 


Figure 10-21. External Bus Reset Sequence 
RS \ / 


Address Previous state FF80h 


Bank 
switching 


<-> 


Reset state-«————_> 


Notes: 1) RSis an asynchronous input and can be asserted at any point during a clock cycle. If the specified timings are met, 
the sequence shown occurs; otherwise, an additional delay of one clock cycle can occur. 


2) During reset, the data bus, is placed in high impedance and the control signals are de-asserted. 
3) The reset vector is fetched with seven wait states. 
4) The bank-switching cycle is inserted in the first access after reset. 
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10.5.2 IDLE3 


The execution of the IDLE 3 instruction initiates the IDLE3 power-down mode. 
In this power-down mode, the PLL is halted completely to reduce power 
consumption. In the IDLE mode, the input clock can be kept running without 
additional power consumption because a transfer gate inside the ’54x isolates 
the clock from the internal logic. The PLL must be restarted and locked before 
the 54x can resume processing when it exits IDLE3. This power-down mode 
is terminated by activating the external interrupt pins, INTn, NMI and RS, ina 
particular sequence. 


Table 10-8 shows the wake-up time of IDLE3 with the INTn and NMI signals. 
These times are defined for the hardware-configurable PLL. The times for the 
software programmable PLL are given in section 8.5.2. When an interrupt pin 
goes low, an internal counter counts the input clock cycles. The initial value 
loaded in the counter depends on the PLL multiplication factor to ensure the 
counter down-time is greater than 50 us for a 40 MIPS DSP. 


Table 10-8. Counter Down-Time With PLL Multiplication Factors at 40 MHz Operation 
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PLL Equivalent Counter 
Counter Multiplication CLKOUT Down-Time 
Start Value Factor Cycles (N) at 40 MHz (us) 
2048 1 2048 51.2 
2048 1.5 3072 76.8 
1024 2 2048 51.2 
1024 2.5 2560 64 
1024 3 3072 76.8 
512 4 2048 51.2 
512 4.5 2304 57.6 
512 5 2560 64 


The counter down-times in Table 10-8 are valid when the input clock frequen- 
cy is such that the CLKOUT frequency is 40 MHz when the PLL is locked. After 
the counter counts down to 0, the output from the locked PLL is fed to the inter- 
nal logic. 


Figure 10-22. 


Start-Up Access Sequences 


A low pulse (minimum duration of 10 ns) of an external interrupt causes the 
54x to wake up from IDLES3 (see Figure 10-22). The locked PLL clock is fed 
into the CPU after ncycles. An additional three cycles are needed before the 
54x comes out of IDLE3. However, the 54x does not need an extra two cycles 
for interrupt synchronization because the interrupt pulse initializes the inter- 
rupt synchronization, which is used to detect the interrupt immediately after the 
"54x wake-up. 


When reset is used to wake up from IDLE3, the counter is not used; the output 
from the PLL is immediately fed to the internal logic and the CLKOUT pin is 
asserted. The lock-up time is 50 us for the PLL and CLKOUT to be stable. 
Therefore, it is necessary to keep the reset line low during this 50-us lock-up 
time so that the ’54x does not start processing using an unstable clock. 


IDLE3 Wake-Up Sequence 
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10.6 Hold Mode 
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Two signals, HOLD and HOLDA, allow an external device to take control of the 
processor’s buses. The processor acknowledges receiving a HOLD signal 
from an external device by bringing HOLDA low. The 54x enters the hold mode 
and places its external address buses, data buses, and control signals into 
high impedance. 


The hold mode (HM) status bit, located in ST1, determines the following oper- 
ating modes for the hold function: 


[1 Normal mode suspends program execution during a low HOLD signal. 


[J Concurrent mode allows program execution to continue operating from in- 
ternal memory (ROM or RAM). 


When HM = 1, the 54x operates in the normal mode. When HM = 0, the 54x 
operates in the concurrent mode. In this mode, the ’54x enters the hold state 
only if program execution is from external memory or if an external-memory 
operand is being accessed. However, if program execution is from internal 
memory and no external memory operands are accessed, the ’54x enters the 
hold state externally but program execution continues internally. Thus, a pro- 
gram can continue executing while an external operation is performed. This 
makes the system operation more efficient. 


Program execution ceases until HOLD is removed if the ’54x is in a hold state 
with HM = 0 and an internally executing program requires an external access, 
or a branch to an external address. Also, if a repeat instruction that requires 
the use of the external bus is executing with HM = 0 when a hold occurs, the 
hold state is entered after the current bus cycle. If ahold occurs when a repeat 
instruction is executing with HM = 1, the 54x halts the execution after the 
current bus cycle, for either internal or external accesses. Upon reset, HM is 
cleared to 0. HM is set and reset by the SSBX and RSBx instructions, respec- 
tively. 


HOLD is not treated as an interrupt. The hold is accepted while executing the 
IDLE1 instruction regardless of the HM values. The hold is not accepted while 
executing the IDLE2 or IDLE instructions regardless of the HM value. If HOLD 
is received, the CPU continues to execute the IDLE instruction even though 
the external buses and the control signals are placed in high impedance. 


Figure 10-23 shows the timing for HOLD and HOLDA. If HOLD meets the set- 
up time before CLKOUT is low, a minimum of three machine cycles are needed 
before the buses and control signals go into high impedance. The HOLD is an 
external asynchronous input which is not latched. The external device must 
keep HOLD low. The external device can determine that the hold state has 
been entered when it receives a HOLDA signal from the 54x. 


Hold Mode 


If the ‘54x is in the middle of a multicycle instruction, it finishes the instruction 
before entering the hold state. After the instruction is completed, the buses are 
placed into high impedance. This also applies to instructions that become mul- 
ticycle because wait states are added. 


After HOLD is de-asserted, program execution resumes at the same instruc- 
tion from which it was halted. HOLDA is removed synchronously with HOLD, 
as shown in Figure 10—23. If the setup time is met, the processor requires two 
machine cycles (HM = 0) or three machine cycles (HM = 1) before the buses 
and control signals become valid. 


10.6.1 Interrupts During Hold 


All interrupts are disabled while HOLD is active with HM = 1. If an interrupt is 
received during this period, the interrupt is latched and remains pending; 
therefore, HOLD does not affect any interrupt flags or registers. When HM = 0, 
interrupts function normally. 


10.6.2 Hold and Reset 


lf HOLD is asserted while RS is active, normal reset operation occurs internal- 
ly, but all buses and control lines remain or become high impedance and 
HOLDA is asserted, as shown in Figure 10-24 (a) and (b). However, if RS is 
asserted while HOLD and HOLDA are active, the CPU is reset and the data 
bus, address bus, and control signals remain in high impedance, as shown in 
Figure 10-24 (c) and (d). 


If HOLD is de-asserted while RS is active, HOLDA is de-asserted normally in 
response, and the address, data, and control lines are driven according to the 
active reset state, as shown in Figure 10-24 (a) and (d). However, if RS is de- 
asserted while HOLD and HOLDA are active, the operation of the device de- 
pends on the state of the MP/MC pin. If MP/MC is high, the CPU begins fetch- 
ing the reset vector when the hold mode is exited. If MP/MC is low, the CPU 
fetches the reset vector internally and continues processing, unless it requires 
an external access before the hold mode is exited. Figure 10—24 (b) and (c) 
show examples of the case in which RS is de-asserted while HOLD and HOL- 
DA are active. 
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Figure 10-23. HOLD and HOLDA Minimum Timing for HM = 0 
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Notes: 1) The timing shows the hold mode when HM = 0. When HM = 1, another cycle is required before HOLDA becomes 
inactive. 


2) The first cycle after releasing the hold mode is a cycle of bank switching. 
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Figure 10-24. HOLD and RS Interaction 


(a) Hold is Asserted While Reset is Active and De-asserted While Reset is Active 
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Figure 10-24. HOLD and R&S Interaction (Continued) 


(c) Hold is Asserted and De-asserted While Reset is Inactive. 
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(d) Hold is Asserted While Reset is Inactive and De-asserted While Reset is Active 
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Design Considerations for 
Using XDS510 Emulator 


This appendix assists you in meeting the design requirements of the Texas 
Instruments XDS510 emulator with respect to IEEE-1149.1 designs and 
discusses the XDS510 cable (manufacturing part number 2617698-0001). 
This cable is identified by a label on the cable pod marked JTAG 3/5V and sup- 
ports both standard 3-V and 5-V target system power inputs. 


The term JTAG, as used in this book, refers to Tl scan-based emulation, which 
is based on the IEEE 1149.1 standard. 


For more information concerning the IEEE 1149.1 standard, contact IEEE 
Customer Service: 


Address: IEEE Customer Service 
445 Hoes Lane, PO Box 1331 
Piscataway, NJ 08855-1331 


Phone: (800) 678—IEEE in the US and Canada 
(908) 981-1393 outside the US and Canada 


FAX: (908) 981-9667 Telex: 833233 
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Designing Your Target System’s Emulator Connector (14-Pin Header) 


A.1 Designing Your Target System’s Emulator Connector (14-Pin Header) 


JTAG target devices support emulation through a dedicated emulation port. 
This port is accessed directly by the emulator and provides emulation func- 
tions that are a superset of those specified by IEEE 1149.1. To communicate 
with the emulator, your target system must have a 14-pin header (two rows of 
seven pins) with the connections that are shown in Figure A—1. Table A—1 
describes the emulation signals. 


Although you can use other headers, the recommended unshrouded, straight 
header has these DuPont connector systems part numbers: 


Cc) 65610-114 
C) 65611-114 
—) 67996-114 
—) 67997-114 


Figure A-1. 14-Pin Header Signals and Header Dimensions933 


TMS TRST 
i oe Pinto-pin spacing, 0.100 in. (X,Y) 
PD (Vcc) no pin (key)t Pin width, 0.025-in. square post 
TDO GND Pin length, 0.235-in. nominal 
TCK_RET GND 
TCK GND 


EMUO EMU1 


Tt While the corresponding female position on the cable connector is plugged to prevent improper 
connection, the cable lead for pin 6 is present in the cable and is grounded, as shown in the 
schematics and wiring diagrams in this appendix. 
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Table A-1. 14-Pin Header Signal Descriptions 


Emulatort Targett 


Signal Description State State 
EMUO Emulation pin 0 | 0 
EMU1 Emulation pin 1 | 1/0 
GND Ground 

PD(Vcc) Presence detect. Indicates that the emulation O 


cable is connected and that the target is 
powered up. PD should be tied to Vcc in the 
target system. 


TCK Test clock. TCK is a 10.368-MHz clock oO 
source from the emulation cable pod. This 
signal can be used to drive the system test 


clock. 
TCK_RET  Testclock return. Test clock input to the emu- O 
lator. May be a buffered or unbuffered version 
of TCK. 
TDI Test data input O | 
TDO Test data output | O 
TMS Test mode select O | 
TRSTF Test reset O | 


T | = input; O = output 

+Do not use pullup resistors on TRST: it has an internal pulldown device. In a low-noise 
environment, TRST can be left floating. In a high-noise environment, an additional pulldown 
resistor may be needed. (The size of this resistor should be based on electrical current 
considerations.) 
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A.2 Bus Protocol 
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The IEEE 1149.1 specification covers the requirements for the test access port 
(TAP) bus slave devices and provides certain rules, summarized as follows: 


[1 The TMS and TDI inputs are sampled on the rising edge of the TCK signal 
of the device. 


[1 The TDO output is clocked from the falling edge of the TCK signal of the 
device. 


When these devices are daisy-chained together, the TDO of one device has 
approximately a half TCK cycle setup time before the next device’s TDI signal. 
This timing scheme minimizes race conditions that would occur if both TDO 
and TDI were timed from the same TCK edge. The penalty for this timing 
scheme is a reduced TCK frequency. 


The IEEE 1149.1 specification does not provide rules for bus master (emula- 
tor) devices. Instead, it states that the device expects a bus master to provide 
bus slave compatible timings. The XDS510 provides timings that meet the bus 
slave rules. 


Emulator Cable Pod 


A.3 Emulator Cable Pod 


Figure A-2 shows a portion of the emulator cable pod. The functional features 
of the pod are: 


(1 TDO and TCK_RET can be parallel-terminated inside the pod if required 
by the application. By default, these signals are not terminated. 


(1 TCKisdriven with a 74LVT240 device. Because of the high-current drive 
(32-mA Io /Ioy), this signal can be parallel-terminated. If TCK is tied to 
TCK_RET, you can use the parallel terminator in the pod. 


[1 TMS and TDIcan be generated from the falling edge of TCK_RET, accord- 
ing to the IEEE 1149.1 bus slave device timing rules. 


Li TMS and TDI are series-terminated to reduce signal reflections. 


Lj A 10.368-MHz test clock source is provided. You can also provide your 
own test clock for greater flexibility. 


Figure A—2. Emulator Cable Pod Interface 
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Tt The emulator pod uses TCK_RET as its clock source for internal synchronization. TCK is provided as an 
optional target system test clock source. 
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A.4 Emulator Cable Pod Signal Timing 


Figure A-3 shows the signal timings for the emulator cable pod. Table A—2 
defines the timing parameters illustrated in the figure. These timing parame- 
ters are calculated from values specified in the standard data sheets for the 
emulator and cable pod and are for reference only. Texas Instruments does 
not test or guarantee these timings. 


The emulator pod uses TCK_RET as its clock source for internal synchroni- 
zation. TCK is provided as an optional target system test clock source. 


Figure A-3. Emulator Cable Pod Timings 
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Table A—2. Emulator Cable Pod Timing Parameters 


No. Parameter Description Min Max Unit 
1 te(TCK) Cycle time, TCK_RET 35 200 ns 
2 tw(TCKH) Pulse duration, TCK_RET high 15 ns 
3 tw(TCKL) Pulse duration, TCK_RET low 15 ns 
4 ta(TMS) Delay time, TMS or TDI valid for TCK_RET low 6 20 ns 
5 tsu(TDO) Setup time, TDO to TCK_RET high 3 ns 
6 th(TDO) Hold time, TDO from TCK_RET high 12 ns 


Emulation Timing Calculations 


A.5 Emulation Timing Calculations 


Example A-1 and Example A-2 help you calculate emulation timings in your 
system. For actual target timing parameters, see the appropriate data sheet 
for the device you are emulating. 

The examples use the following assumptions: 


tsu(TTMS) Setup time, target TMS or TDI to TCK 


high 10 ns 
td(TTDO) Delay time, target TDO from TCK low 15 ns 
td(bufmax) Delay time, target buffer maximum 10 ns 
td(bufmin) Delay time, target buffer minimum ins 
toufskew Skew time, target buffer between two de- 1.35 ns 


vices in the same package: 
[td(bufmax) — ta(bufmin)] x 0.15 


tTcKfactor Duty cycle, assume a 40/60% duty cycle 0.4 
clock (40%) 
Also, the examples use the following values from Table A—2 on page A-6: 
ta(TMSmax) _ Delay time, emulator TMS or TDI from 20 ns 
TCK_RET low, maximum 
tsu(TDOmin) Setup time, TDO to emulator TCK_RET 3 ns 


high, minimum 
There are two key timing paths to consider in the emulation design: 


L} TheTCK_RET-to-TMSorTDIpath, calledtpg/tcK_RET-TMS/TD)) (Propaga- 
tion delay time) 


L} The TCK_RET-to-TDO path, called tog/TckK_RET-TDO) 


In the examples, the worst-case path delay is calculated to determine the 
maximum system test clock frequency. 
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Example A—1. Key Timing for a Single-Processor System Without Buffers 


E (TMSmax) + su rrrus)| 


t => 
pd (TCK_RET-TMS/TDI) tcktactor 


_ (20 ns + 10 ns) 
0.4 
= 75 ns, or 13.3 MHz 


E (TTD0) * tsu rroomin| 


t 7 — 
pd (TCK_RET—TDO) troktactor 


_ (15 ns + 3 ns) 
0.4 
= 45 ns, or 22.2 MHz 


In this case, because the TCK_RET-to-TMS/TDI path requires more time to 
complete, it is the limiting factor. 


Example A—2. Key Timing for a Single- or Multiple-Processor System With Buffered Input 
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and Output 


ty (TMSmax) a tou (TTMS) + at | 


t 7 = 
pd (TCK_RET—TMS/TDI) trektactor 


(20 ns + 10 ns + 2(10) ns) 
0.4 


= 54 ns, or 18.5 MHz 


ty (TTDO) + tsuctTDOminy + ta tbutskew| 


t 2 — 
pd (TCK_RET—TDO) emcee 


(15 ns + 3 ns + 1.35 ns) 
0.4 


= 58.4 ns, or 20.7 MHz 


In this case also, because the TCK_RET-to-TMS/TDI path requires more time 
to complete, it is the limiting factor. 


Emulation Timing Calculations 


In a multiprocessor application, it is necessary to ensure that the EMUO and 
EMU1 lines can go from a logic low level to a logic high level in less than 10 
us, this parameter is called rise time, t,. This can be calculated as follows: 


tr = 5(Roullup X Ndevices * Cload_per_device) 
= 5(4.7 kQ x 16 x 15 pF) 
= 5(4.7x 103 Qx 16x 15 =no-12F) 
= 5(1128x 10-9) 
= 5.64 us 
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A.6 Connections Between the Emulator and the Target System 


A.6.1 


It is extremely important to provide high-quality signals between the emulator 
and the JTAG target system. You must supply the correct signal buffering, test 
clock inputs, and multiple processor interconnections to ensure proper emula- 
tor and target system operation. 


Signals applied to the EMUO and EMU1 pins on the JTAG target device can 
be either input or output. In general, these two pins are used as both input and 
output in multiprocessor systems to handle global run/stop operations. EMU0O 
and EMU1 signals are applied only as inputs to the XDS510 emulator header. 


Buffering Signals 


If the distance between the emulation header and the JTAG target device is 
greater than 6 inches, the emulation signals must be buffered. If the distance 
is less than 6 inches, no buffering is necessary. Figure A—4 shows the simpler, 
no-buffering situation. 


The distance between the header and the JTAG target device must be no more 
than 6 inches. The EMUO and EMU1 signals must have pullup resistors con- 
nected to Vcc to provide a signal rise time of less than 10 us. A 4.7-kQ resistor 
is suggested for most applications. 


Figure A-4. Emulator Connections Without Signal Buffering 
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= 6 inches or less | 


Voc 


Voc 
JTAG device Emulator header 4 
EMUO EMUO PD 
EMU1 e EMU1 
TRST TRST 
TMS TMS 
TDI TDI 
TDO TDO 
TCK i TCK 
TCK_RET V 


GND 


Figure A—-5 shows the connections necessary for buffered transmission sig- 
nals. The distance between the emulation header and the processor is greater 
than 6 inches. Emulation signals TMS, TDI, TDO, and TCK_RET are buffered 
through the same device package. 


Connections Between the Emulator and the Target System 


Figure A-5. Emulator Connections With Signal Buffering 


Greater than 
6 inches 
Voc Voc 
JTAG device Emulator header 4 
EMUO EMUO PD 
EMU1 i EMU1 
TRST TAST 
TMS <] e TMS 
TDI @ TDI 
TDO > TDO 
TCK TCK 
> TCK_RET Vv 
GND 


The EMUO and EMU1 signals must have pullup resistors connected to Vcc to 
provide a signal rise time of less than 10 us. A 4.7-kQ resistor is suggested for 
most applications. 


The input buffers for TMS and TDI should have pullup resistors connected to 
Vcc to hold these signals at a known value when the emulator is not con- 
nected. A resistor value of 4.7 kQ or greater is suggested. 


To have high-quality signals (especially the processor TCK and the emulator 
TCK_RET signals), you may have to employ special care when routing the 
printed wiring board trace. You also may have to use termination resistors to 
match the trace impedance. The emulator pod provides optional internal paral- 
lel terminators on the TCK_RET and TDO. TMS and TDI provide fixed series 
termination. 


Because TRST is an asynchronous signal, it should be buffered as needed to 
ensure sufficient current to all target devices. 
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A.6.2 Using a Target-System Clock 


Figure A-6 shows an application with the system test clock generated in the 
target system. In this application, the emulator’s TCK signal is left uncon- 
nected. 


Figure A-6. Target-System-Generated Test Clock 


Greater than 
6 inches 
Vcc 
JTAG device Emulator header 
EMUO EMUO 
EMU1 i EMU1 
TRST TRST 
<] e TS 
TDI e TDI 
TDO > TDO 
TCK NC TCK 


> TCK_RET 


System test clock 


Note: Whenthe TMS and TDI lines are buffered, pullup resistors must be used to hold the buffer 
inputs at a known level when the emulator cable is not connected. 


There are two benefits in generating the test clock in the target system: 


(J The emulator provides only a single 10.368-MHz test clock. If you allow 
the target system to generate your test clock, you can set the frequency 
to match your system requirements. 


[1 In some cases, you may have other devices in your system that require 
a test clock when the emulator is not connected. The system test clock 
also serves this purpose. 
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A.6.3 Configuring Multiple Processors 


Figure A—7 shows a typical daisy-chained multiprocessor configuration that 
meets the minimum requirements of the IEEE 1149.1 specification. The 
emulation signals are buffered to isolate the processors from the emulator and 
provide adequate signal drive for the target system. One of the benefits of this 
interface is that you can slow down the test clock to eliminate timing problems. 
Follow these guidelines for multiprocessor support: 


Li The processor TMS, TDI, TDO, and TCK signals must be buffered through 
the same physical device package for better control of timing skew. 


1 The input buffers for TMS, TDI, and TCK should have pullup resistors con- 
nected to Vcc to hold these signals at a known value when the emulator 
is not connected. A resistor value of 4.7 kQ or greater is suggested. 


(1 Buffering EMUO and EMU1 is optional but highly recommended to provide 
isolation. These are not critical signals and do not have to be buffered 
through the same physical package as TMS, TCK, TDI, and TDO. 


Figure A—7. Multiprocessor Connections 


JTAG device JTAG device 


VCC 
Emulator header 


EMUO 


EMU1 


TRST 


TMS 


TDI 


TDO 


TCK 


TCK_RET V 
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A.7 Physical Dimensions for the 14-Pin Emulator Connector 


The JTAG emulator target cable consists of a 3-foot section of jacketed cable 
that connects to the emulator, an active cable pod, and a short section of jack- 
eted cable that connects to the target system. The overall cable length is 
approximately 3 feet 10 inches. Figure A—-8 and Figure A—9 (page A-15) show 
the physical dimensions for the target cable pod and short cable. The cable 
pod box is nonconductive plastic with four recessed metal screws. 


Figure A—8. Pod/Connector Dimensions 


2.70 in., nominal 


4.50 in., nominal 


& 9.50 in., nominal 
@ 


— 0.90 in., 
nominal 


Emulator cable pod Ss Connector 


EES 
Short, jacketed cable it 


See Figure A-9 


Note: All dimensions are in inches and are nominal dimensions, unless otherwise specified. Pin-to-pin spacing on the connec- 
tor is 0.100 inches in both the X and Y planes. 
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Physical Dimensions for the 14-Pin Emulator Connector 
Figure A-9. 14-Pin Connector Dimensions 
> — se 0.20 i nch, 
nominal 


Cable 


0.66 inch, 
nominal 


YY, 


Connector, side view 


0.100 inch, 
nominal 
(pin spacing) 


Key, pin 6 


A 
0.87 inch, 
Cable t nominal 
0.100 inch, 
nominal 
(pin spacing) 


y 


Connector, front view 


] 
2 rows of pins 
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A.8 Emulation Design Considerations 


This section describes the use and application of the scan path linker (SPL), 
which can simultaneously add all four secondary JTAG scan paths to the main 
scan path. It also describes the use of the emulation pins and the configuration 
of multiple processors. 


A.8.1_ Using Scan Path Linkers 
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You can use the Tl ACT8997 scan path linker (SPL) to divide the JTAG 
emulation scan path into smaller, logically connected groups of 4 to 16 
devices. As described in the Advanced Logic and Bus Interface Logic Data 
Book, the SPL is compatible with the JTAG emulation scanning. The SPL is 
capable of adding any combination of its four secondary scan paths into the 
main scan path. 


A system of multiple, secondary JTAG scan paths has better fault tolerance 
and isolation than a single scan path. Since an SPL has the capability of adding 
all secondary scan paths to the main scan path simultaneously, it can support 
global emulation operations, such as starting or stopping a selected group of 
processors. 


Tl emulators do not support the nesting of SPLs (for example, an SPL 
connected to the secondary scan path of another SPL). However, you can 
have multiple SPLs on the main scan path. 


Scan path selectors are not supported by this emulation system. The TI 
ACT8999 scan path selector is similar to the SPL, but it can add only one of 
its secondary scan paths at a time to the main JTAG scan path. Thus, global 
emulation operations are not assured with the scan path selector. 


You can insert an SPL on a backplane so that you can add up to four device 
boards to the system without the jumper wiring required with nonbackplane 
devices. You connect an SPL to the main JTAG scan path in the same way you 
connect any other device. Figure A-10 shows how to connect a secondary 
scan path to an SPL. 
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Figure A—10. Connecting a Secondary JTAG Scan Path to a Scan Path Linker 


SPL 
DTCK Tp) YTAGO 
TDI | DTDOO TS 
¢ TMS | DTMSO ¢ TCK 
4 TCK | DTDIO ¢ TRST 
TRST (oepo $ TDO 


me DTMS1 


DTDI1 


DTDO2 
DTMS2 
DTDI2 


DTDO3 
DTMS3 
DTDI3 


The TRST signal from the main scan path drives all devices, even those on 
the secondary scan paths of the SPL. The TCK signal on each target device 
onthe secondary scan path of an SPL is driven by the SPL’s DTCK signal. The 
TMS signal on each device on the secondary scan path is driven by the respec- 
tive DTMS signals on the SPL. 


DTDOO onthe SPL is connected to the TDI signal of the first device on the sec- 
ondary scan path. DTDIO on the SPL is connected to the TDO signal of the last 
device in the secondary scan path. Within each secondary scan path, the TDI 
signal of a device is connected to the TDO signal of the device before it. If the 
SPLis ona backplane, its secondary JTAG scan paths are on add-on boards; 
if signal degradation is a problem, you may need to buffer both the TRST and 
DTCK signals. Although degradation is less likely for DTMSn signals, you may 
also need to buffer them for the same reasons. 
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A.8.2 Emulation Timing Calculations for a Scan Path Linker (SPL) 


Example A-3 and Example A-4 help you to calculate the key emulation tim- 
ings in the SPL secondary scan path of your system. For actual target timing 
parameters, see the appropriate device data sheet for your target device. 


The examples use the following assumptions: 


tsu(TTMS) Setup time, target TMS/TDI to TCK high 10ns 
ta(TTDO) Delay time, target TDO from TCK low 15 ns 
td(bufmax) Delay time, target buffer, maximum 10 ns 
td(bufmin) Delay time, target buffer, minimum Ins 
t(bufskew) Skew time, target buffer, between two 1.35 ns 


devices in the same package: 
[td(bufmax) — td(bufmin)] x 0.15 


t(TCKfactor) Duty cycle, TCK assume a 40/60% clock res 


Also, the examples use the following values from the SPL data sheet: 


ta(DTMSmax) Delay time, SPL DTMS/DTDO from TCK 31 ns 
low, maximum 

tsu(DTDLmin) Setup time, DTDI to SPL TCK high, 7 ns 
minimum 

td(DTCKHmin) Delay time, SPL DTCK from TCK high, 2ns 
minimum 

ta(DTCKLmax) Delay time, SPL DTCK from TCK low, 16 ns 
maximum 


There are two key timing paths to consider in the emulation design: 


L) The TCK-to-DTMS/DTDO path, called tog(TcK-DTMS) 


LJ) The TCK-to-DTDI path, called tog(TcK-DTDI) 
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Of the following two cases, the worst-case path delay is calculated to deter- 
mine the maximum system test clock frequency. 


Example A-3. Key Timing for a Single-Processor System Without Buffering (SPL) 


+ ty 


tg(oTMSmax) + ta(oTCKHmin) + su rrmus)| 


t — 
pare EMS) ttcktactor 


(31 ns + 2 ns + 10 ns) 
0.4 


= 107.5 ns, or 9.38 MHz 


+t 


ty (TTDO) = ty (DTCKLmax) § ‘su oro) 


tod (TCK-DTDI) t 


TCKfactor 


(15 ns + 16 ns + 7 ns) 
0.4 


= 9.5 ns, or 10.5 MHz 


In this case, the TCK-to-DTMS/DTDL path is the limiting factor. 


Example A-4. Key Timing for a Single- or Multiorocessor-System With Buffered Input 
and Output (SPL) 


ta(oTMSmax) + '(oTcKHmin) + 'sucttms) + tutskow 


t : = 
pare TRM=) ttcktactor 


(31 ns + 2 ns + 10 ns + 1.35 ns) 
0.4 


= 110.9 ns, or 9.0 MHz 


te (TTD0) * ty(otcKLmax) * tsu(oTDLmin) + tg tosshow | 


t = — 
peer eee) ttcktactor 


(15 ns + 15 ns + 7 ns + 10 ns) 
0.4 


= 120 ns, or 8.38 MHz 


In this case, the TCK-to-DTDI path is the limiting factor. 


Design Considerations for Using XDS510 Emulator A-19 


Emulation Design Considerations 


A.8.3 Using Emulation Pins 
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The EMU0/1 pins of TI devices are bidirectional, 3-state output pins. When in 
an inactive state, these pins are at high impedance. When the pins are active, 
they provide one of two types of output: 


(1 Signal Event. The EMU0/1 pins can be configured via software to signal 
internal events. In this mode, driving one of these pins low can cause 
devices to signal such events. To enable this operation, the EMU0/1 pins 
function as open-collector sources. External devices such as logic analyz- 
ers can also be connected to the EMU0/1 signals in this manner. If such 
an external source is used, it must also be connected via an open-collector 
source. 


(7 External Count. The EMU0/1 pins can be configured via software as 
totem-pole outputs for driving an external counter. If the output of more 
than one device is configured for totem-pole operation, then these devices 
can be damaged. The emulation software detects and prevents this condi- 
tion. However, the emulation software has no control over external 
sources on the EMU0/1 signal. Therefore, all external sources must be 
inactive when any device is in the external count mode. 


TI devices can be configured by software to halt processing if their EMUO/1 
pins are driven low. This feature combined with the signal event output, allows 
one TI device to halt all other Tl devices on a given event for system-level de- 


bugging. 


If you route the EMU0/1 signals between multiple boards, they require special 
handling because they are more complex than normal emulation signals. 
Figure A-11 shows an example configuration that allows any processor in the 
system to stop any other processor in the system. Do not tie the EMU0/1 pins 
of more than 16 processors together in a single group without using buffers. 
Buffers provide the crisp signals that are required during a RUNB (run bench- 
mark) debugger command or when the external analysis counter feature is 
used. 


Emulation Design Considerations 


Figure A-11. EMUO0/1 Configuration to Meet Timing Requirements of Less Than 25 ns 


fl re Target board 1_ 
| | 
Pullup | 
| Open- resistor | 
| collector 4 ed EMUO/1 | 
| drivers | 
Backplane | <] Device] _—| Device | 
XCNT_ENABLE | 1 n | 
—— 4 
EMU0/1-IN 
° 
i Pullup 
resistor | | --———-—-—-— - 
EMUO/1-OUT 4 I Target board m | 
| | 
TCK To emulator EMUO | | 
| Pullup | 
| Open- resistor | 
| collector ¢ = ° EMUO/1 | 
| drivers | 
| <] Device] __—‘| Device | 
| l a | 
Ep a se ee a 


Notes: 1) Thelowtime on EMU0/1-IN should be at least one TCK cycle and less than 10 us. Software sets the EMU0/1-OUT 
pin to a high state. 


2) To enable the open-collector driver and pullup resistor on EMU1 to provide rise/fall times of less than 25 ns, the modifi- 
cation shown in this figure is suggested. Rise times of more than 25 ns can cause the emulator to detect false edges 
during the RUNB command or when the external counter selected from the debugger analysis menu is used. 


These seven important points apply to the circuitry shown in Figure A-11 and 
the timing shown in Figure A-12: 


[J Open-collector drivers isolate each board. The EMU0/1 pins are tied 
together on each board. 


1 Atthe board edge, the EMU0/1 signals are split to provide both input and 
output connections. This is required to prevent the open-collector drivers 
from acting as latches that can be set only once. 


[J The EMU0/1 signals are bused down the backplane. Pullup resistors must 
be installed as required. 
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(J The bused EMU0/1 signals go into a programmable logic array device 


PAL® whose function is to generate a low pulse on the EMU0/1-IN signal 
when a low level is detected on the EMU0/1-OUT signal. This pulse must 
be longer than one TCK period to affect the devices but less than 10 us 
to avoid possible conflicts or retriggering once the emulation software 
clears the device’s pins. 


During a RUNB debugger command or other external analysis count, the 
EMU0/1 pins on the target device become totem-pole outputs. The EMU1 
pin is a ripple carry-out of the internal counter. EMU0 becomes a proces- 
sor-halted signal. During a RUNB or other external analysis count, the 
EMU0/1-IN signal to all boards must remain in the high (disabled) state. 
You must provide some type of external input (XCNT_ENABLE) to the 
PAL® to disable the PAL® from driving EMUO/1-IN to a low state. 


If you use sources other than TI processors (such as logic analyzers) to 
drive EMU0/1, their signal lines must be isolated by open-collector drivers 
and be inactive during RUNB and other external analysis counts. 


You must connect the EMU0/1-OUT signals to the emulation header or 
directly to a test bus controller. 


Figure A-12. Suggested Timings for the EMU0 and EMU17 Signals 
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me NF YF VS 7 
EMU0/1-OUT SRL 
EMUO0/1-IN NON / 
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Figure A—13. EMUO0/1 Configuration With Additional AND Gate to Meet Timing 
Requirements of Greater Than 25 ns 


1 ee) ee ee a Target board 1 
| | 
Pullup | 
| Open- resistor | 
| collector ¢ “sie EMUO/1 | 
| drivers | 
Backplane +<|— Device] _—[ Device | 
XCNT_ENABLE 1 n | 
| | 
Ma a i a 4 

EMU0/1-IN 

9 
PAL 
Pullup 

EMUO0/1-OUT < resistor [ey ee a ee a 


+ | 
| 
| 

TCK To Emulator EMUO Pullup 
| 
| 


| 

| 

| 

| 

Open- resistor | 
| 

| 

| 

| 


Circuitry required for >25-ns collector ¢ ae EMUO/1 
rise/fall time modification | drivers 


ere ns eee : 


iw) 
oO 
=< 
ie) 
oO 
iw) 
oO 
a< 
[?) 
oO 


To emulator EMU1 


| | 
| | 
| | 
7 : EMU1 signal from other boards 
| | 
i 


Notes: 1) Thelowtime on EMU0/1-IN should be at least one TCK cycle and less than 10 us. Software will set the EMU0/1-OUT 
port to a high state. 


2) To enable the open-collector driver and pullup resistor on EMU1 to provide rise/fall time of greater than 25 ns, the 
modification shown in this figure is suggested. Rise times of more than 25 ns can cause the emulator to detect false 
edges during the RUNB command or when the external counter selected from the debugger analysis menu is used. 
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You do not need to have devices on one target board stop devices on another 
target board using the EMU0/1 signals (see the circuit in Figure A-14). In this 
configuration, the global-stop capability is lost. It is important not to overload 
EMUO0/1 with more than 16 devices. 


Figure A—14. EMUO0/1 Configuration Without Global Stop 


Note: 


I 7 
fi Target board 1 | 
| 
| Pullup | 
l resistor | 
EMUO/1 : 
Pullup | 

resistor | a : 

n 
To emulator r) | | 
EMUO0/1 l | 
a ce ee ee J 
CS SS Se aq 
| Target boardm | 
| | 
| | 
| Pullup | 
| resistor | 
| ° ne © EMUO/1 | 
| 

n 
| | 
Me eS, | 


The open-collector driver and pullup resistor on EMU1 mustbe able to provide rise/fall times of less than 25 ns. Rise times 
of more than 25 ns can cause the emulator to detect false edges during the RUNB command or when the external counter 
selected from the debugger analysis menu is used. If this condition cannot be met, then the EMU0/1 signals from the 
individual boards must be ANDed together (as shown in Figure A-14) to produce an EMU0/1 signal for the emulator. 


A.8.4_ Performing Diagnostic Applications 
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For systems that require built-in diagnostics, it is possible to connect the 
emulation scan path directly to a TI ACT8990 test bus controller (TBC) instead 
of the emulation header. The TBC is described in the Texas Instruments 
Advanced Logic and Bus Interface Logic Data Book. Figure A-15 shows the 
scan path connections of n devices to the TBC. 
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Figure A—15. TBC Emulation Connections for n JTAG Scan Paths 


4 


TBC TCKI 
TDO Tp! YTAGO 
TMSO ° TMS 
TMS1 EMUO 
TMS2/EVNTO + EMU1 
TMS3/EVNT1 é TRST 
TMS4/EVNT2 TCK 
TMS5/EVNT3 a TDO 
TCKO : 
TDIO : 
TDI Tp! YTAGN 
TMS 
+ EMUO 
4 EMU1 
4 TRST 
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In the system design shown in Figure A-15, the TBC emulation signals TCKI, 
TDO, TMSO, TMS2/EVNTO, TMS3/EVNT1, TMS5/EVNT3, TCKO, and TDIO 
are used, and TMS1, TMS4/EVNT2, and TDI1 are not connected. The target 
devices’ EMU0 and EMU1 signals are connected to Vcc through pullup resis- 
tors and tied to the TBC’s TMS2/EVNT0 and TMS3/EVNT1 pins, respectively. 
The TBC’s TCKI pin is connected to a clock generator. The TCK signal for the 
main JTAG scan path is driven by the TBC’s TCKO pin. 


On the TBC, the TMSO pin drives the TMS pins on each device on the main 
JTAG scan path. TDO on the TBC connects to TDI on the first device on the 
main JTAG scan path. TDIO on the TBC is connected to the TDO signal of the 
last device on the main JTAG scan path. Within the main JTAG scan path, the 
TDI signal of a device is connected to the TDO signal of the device before it. 
TRST for the devices can be generated either by inverting the TBC’s 
TMS5/EVNTS signal for software control or by logic on the board itself. 
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Development Support and Part Order Information 


This appendix provides development support information, device part num- 
bers, and support tool ordering information for the ’54x. 


Each ’54x support product is described in the TMS320 DSP Development 
Support Reference Guide. In addition, more than 100 third-party developers 
offer products that support the Tl TMS320 family. For more information, refer 
to the TMS320 Third-Party Support Reference Guide. 


For information on pricing and availability, contact the nearest Tl Field Sales 
Office or authorized distributor. See the list at the back of this book. 
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B.1 Development Support 


This section describes the development support provided by Texas Instru- 
ments. 


B.1.1 Development Tools 


TI offers an extensive line of development tools for the ‘54x generation of 
DSPs, including tools to evaluate the performance of the processors, generate 
code, develop algorithm implementations, and fully integrate and debug soft- 
ware and hardware modules. 


Code Generation Tools 


(i ~The optimizing ANSI C compiler translates ANSI C language directly into 
highly optimized assembly code. You can then assemble and link this code 
with the Tl assembler/linker, which is shipped with the compiler. This prod- 
uct is currently available for PCs (DOS, DOS extended memory, OS/2), 
HP workstations, and SPARC workstations. See the TMS320C54x Opti- 
mizing C Compiler User’s Guide for detailed information about this tool. 


(J The assembler/linker converts source mnemonics to executable object 
code. This product is currently available for PCs (DOS, DOS extended 
memory, OS/2). The ‘54x assembler for HP and SPARC workstations is 
available only as part of the optimizing ‘54x compiler. See the 
TMS320C54x Assembly Language Tools User’s Guide for detailed in- 
formation about available assembly-language tools. 


System Integration and Debug Tools 


(J The simulator simulates (via software) the operation of the ’54x and can 
be used in C and assembly software development. This product is current- 
ly available for PCs (DOS, Windows), HP workstations, and SPARC 
workstations. See the TMS320C54x C Source Debugger User’s Guide for 
detailed information about the debugger. 


(i The XDS510 emulator performs full-speed in-circuit emulation with the 
54x, providing access to all registers as well as to internal and external 
memory of the device. It can be used in C and assembly software develop- 
ment and has the capability to debug multiple processors. This product is 
currently available for PCs (DOS, Windows, OS/2), HP workstations, and 
SPARC workstations. This product includes the emulator board (emulator 
box, power supply, and SCSI connector cables in the HP and SPARC ver- 
sions), the 54x C source debugger and the JTAG cable. 
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Because the ’C2xx, 'C3x, 'C4x, and ’C5x XDS510 emulators also come 
with the same emulator board (or box) as the 54x, you can buy the 54x C 
Source Debugger Software as a separate product called the ‘54x C 
Source Debugger Conversion Software. This enables you to debug ’54x 
applications with a previously purchased emulator board. The emulator 
cable that comes with the ’C3x XDS510 emulator cannot be used with the 
54x. You need the JTAG emulation conversion cable (see section B.2) 
instead. The emulator cable that comes with the ’C5x XDS510 emulator 
can be used with the ’54x without any restriction. See the TMS320C54x C 
Source Debugger User's Guide) for detailed information about the ’54x 
emulator. 


Li) The TMS320C54x evaluation module (EVM) is a PC/AT plug-in card that 
lets you evaluate certain characteristics of the ‘54x digital signal processor 
to see if it meets your application requirements. The 54x EVM carries a 
541 DSP on board to allow full-speed verification of 54x code. The EVM 
has 5K bytes of on-chip program/data RAM, 28K bytes of on-chip ROM, 
two serial ports, a timer, access to 64K bytes each of external program and 
data RAM, and an external analog interface for evaluation of the ’54x fami- 
ly of devices for applications. See the TMS320C54x Evaluation Module 
Technical Reference for detailed information about the ‘54x EVM. 


B.1.2 Third-Party Support 


The TMS320 family is supported by products and services from more than 100 
independent third-party vendors and consultants. These support products 
take various forms (both as software and hardware), from cross-assemblers, 
simulators, and DSP utility packages to logic analyzers and emulators. The 
expertise of those involved in support services ranges from speech encoding 
and vector quantization to software/hardware design and system analysis. 


To ask about third-party services, products, applications, and algorithm devel- 
opment packages, contact the third party directly. Refer to the TMS320 Thira- 
Party Support Reference Guide for addresses and phone numbers. 
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B.1.3. Technical Training Organization (TTO) TMS320 Workshops 


B.1.4 Assistance 
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"54x DSP Design Workshop. This workshop is tailored for hardware and soft- 
ware design engineers and decision-makers who will be designing and utiliz- 
ing the ’54x generation of DSP devices. Hands-on exercises throughout the 
course give participants a rapid start in developing 54x design skills. Micropro- 
cessor/assembly language experience is required. Experience with digital 
design techniques and C language programming experience is desirable. 


These topics are covered in the ‘54x workshop: 


L] 
L) 
L] 
L} 
L] 
Lj 
Lj 


54x architecture/instruction set 

Use of the PC-based software simulator 
Use of the 54x assembler/linker 

C programming environment 

System architecture considerations 
Memory and I/O interfacing 
Development support 


For registration information, pricing, or to enroll, call (800)336—5236, ext. 3904. 


For assistance to TMS320 questions on device problems, development tools, 
documentation, software upgrades, and new products, you can contact TI. 
See /f You Need Assistance in Preface for information. 
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B.2 Part Order Information 


This section describes the part numbers of 54x devices, development support 
hardware, and software tools. 


B.2.1 Device and Development Support Tool Nomenclature Prefixes 


To designate the stages in the product development cycle, Tl assigns prefixes 
to the part numbers of all TMS320 devices and support tools. Each TMS320 
device has one of three prefix designators: TMX, TMP, or TMS. Each support 
tool has one of two possible prefix designators: TMDX or TMDS. These pre- 
fixes represent evolutionary stages of product development from engineering 
prototypes (TMX/TMDX) through fully qualified production devices and tools 
(TMS/TMDS). This development flow is defined below. 


Device Development Evolutionary Flow: 


TMX The partis an experimental device that is not necessarily representa- 
tive of the final device’s electrical specifications. 

TMP _Thepartis adevice from afinal silicon die that conforms to the device’s 
electrical specifications but has not completed quality and reliability 
verification. 


TMS _ The partis a fully qualified production device. 
Support Tool Development Evolutionary Flow: 


TMDX The development-support product that has not yet completed Texas 
Instruments internal qualification testing. 


TMDS The development-support product is a fully qualified development 
support product. 


TMX and TMP devices and TMDX development support tools are shipped with 
the following disclaimer: 


“Developmental product is intended for internal evaluation purposes.” 


TMS devices and TMDS development support tools have been fully character- 
ized, and the quality and reliability of the device has been fully demonstrated. 
Texas Instruments standard warranty applies to these products. 


cca, | 
Note: 


Itis expected that prototype devices (TMX or TMP) have a greater failure rate 
than standard production devices. Texas Instruments recommends that 
these devices notbe used in any production system, because their expected 
end-use failure rate is still undefined. Only qualified production devices 


should be used. 
| 
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B.2.2 Device Nomenclature 


Tl device nomenclature includes the device family name and a suffix. 
Figure B—1 provides a legend for reading the complete device name for any 
54x device family member. 
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Figure B—1. TMS320 DSP Device Nomenclature 


TMS 320 C 542 PGE (L) 
PREFIX _ TEMPERATURE RANGE (DEFAULT: 0°C TO 70°C) 
TMX= experimental device H = 0°C to 50°C 
TMP=_ prototype device L = O°C to 70°C 
TMS= qualified device S = -55°C to 100°C 
SMJ= MIL-STD-883C M = —55°C to 125°C 
SM = High Rel (non-883C) A = -—40°C to 85°C 
DEVICE FAMILY PACKAGE TYPET 
20 = TMS320 Famil N= plastic DIP 
oad peetiramly J = ceramic DIP 
JD = ceramic DIP side-brazed 
GB = ceramic PGA 
FZ = ceramic CC 
FN = plastic leaded CC 
FD = ceramic leadless CC 
TECHNOLOGY PJ = 100-pin plastic EIAJ QFP 
C = CMOS PZ = 100-pin plastic TQFP 
E = CMOSEPROM PBK = 128-pin plastic TQFP 
F = CMOS Flash EEPROM PQ_ = 132-pin plastic bumpered QFP 
LC= Low-Voltage CMOS (3.3 V) a eee 
VC= Low-Voltage CMOS [3 V (2.5 V GGU= 144-pin BGA 


or 1.8 V core)] 


t DIP = Dual-In-Line Package 
PGA = Pin Grid Array 
CC = Chip Carrier 
QFP = Quad Flat Package 
TQFP = Thin Quad Flat Package 
BGA = Ball Grid Array 


PGF = 176-pin plastic TQFP 
GGW= 176-pin BGA 


DEVICE 
“1x DSP: 
10 16 
14 17 
15 
2x DSP: 
25 
26 
*2xx DSP: 
203 206 240 
204 209 
8x DSP: 
30 
31 
32 
4x DSP: 
40 
44 
5x DSP: 
50 53 
51 56 
52 57 
54x DSP: 
54 545 5402 
542 546 5410 
543 548 5420 
549 
6x DSP: 
6201 6201 
6701 6211 
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B.2.3 Development Support Tools 


Table B—1 lists the development support tools available for the 54x, the plat- 
form on which they run, and their part numbers. 


Table B—1. Development Support Tools Part Numbers 


Development Tool 


Assembler/Linker 
C Compiler/Assembler/Linker 
C Compiler/Assembler/Linker 


C Source Debugger Conversion Software 


C Source Debugger Conversion Software 


Evaluation Module (EVM) 
Simulator (C language) 
Simulator (C language) 
XDS510 Emulatort 
XDS510WS Emulatort 


3 V/5 V PC/SPARC JTAG Emulation Cable 


Platform 
PC (DOS™) 
PC (DOS™, Windows™, OS/2™) 
HP (HP-UX™) / SPARC ™ (Sun OS™) 


PC (DOS™, Windows™, OS/2™) 
(XDS510™) 


HP (HP-UX™) / SPARC™ (Sun OS™) 
(XDS510WS™) 


PC (DOS™, Windows™, OS/2™) 

PC (DOS™, Windows™ ) 

HP (HP-UX™) / SPARC™ (Sun OS™) 
PC (DOS™, Windows™, OS/2™) 


HP (HP-UX™) / SPARC™ (Sun OS™) 
(SCSI) 


XDS510™ / XDS510WS™ 


Part Number 
TMDS324L850-02 


TMDS324L855-02 
TMDS324L555-08 
TMDS32401L0 


TMDS32406L0 


TMDX3260051 
TMDS324L851-02 
TMDS324L551-09 
TMDS00510 
TMDS00510WS 


TMDS3080002 


t Includes XDS510 board and JTAG cable; TMDS32401L0 C-source debugger conversion software not included 
+ Includes XDS510WS box, SCSI cable, power supply, and JTAG cable; TMDS32406L0 C-source debugger conversion software 


not included 


No) of=V alo | ae 


Submitting ROM Codes to TI 


The size of a printed circuit board is a consideration in many DSP applications. 
To make full use of the board space, Texas Instruments offers this ROM code 
option that reduces the chip count and provides a single-chip solution. This op- 
tion allows you to use a code-customized processor for a specific application 
while taking advantage of: 


_} Greater memory expansion 
_j Lower system cost 

_j Less hardware and wiring 
LJ Smaller PCB 


If a routine or algorithm is used often, it can be programmed into the on-chip 
ROM of a TMS320 DSP. TMS320 programs can also be expanded by using 
external memory; this reduces chip count and allows for a more flexible pro- 
gram memory. Multiple functions are easily implemented by a single device, 
thus enhancing system capabilities. 


TMS320 development tools are used to develop, test, refine, and finalize the 
algorithms. The microprocessor/microcomputer (MP/MC) mode is available 
on all ROM-coded TMS320 DSP devices when accesses to either on-chip or 
off-chip memory are required. The microprocessor mode is used to develop, 
test, and refine a system application. In this mode of operation, the TMS320 
acts as a standard microprocessor by using external program memory. When 
the algorithm has been finalized, the code can be submitted to Texas Instru- 
ments for masking into the on-chip program ROM. At that time, the TMS320 
becomes a microcomputer that executes customized programs from the on- 
chip ROM. Should the code need changing or upgrading, the TMS320 can 
once again be used in the microprocessor mode. This shortens the field- 
upgrade time and avoids the possibility of inventory obsolescence. 


Figure C-1 illustrates the procedural flow for developing and ordering 
TMS320 masked parts. When ordering, there is a one-time, nonrefundable 
charge for mask tooling. A minimum production order per year is required for 
any masked-ROM device. ROM codes will be deleted from the TI system one 
year after the final delivery. 
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Figure C—1. TMS320 ROM Code Submittal Flowchart 
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Customer TMS320 Design 


Customer submits: 

— TMS320 New Code Release Form 

— Print Evaluation and Acceptance Form (PEAF) 
— Purchase order for mask prototypes 

— TMS320 code 


Texas Instruments responds: 
— Customer code input into TI system 
— Code sent back to customer for verification 


Customer 


approves 
algorithm 


Customer 
approves 


No 


prototypes (minimum 
production order 
required) 


TMS320 production 


Submitting ROM Codes to TI 


The TMS320 ROM code may be submitted in one of the following forms: 


3-1/2-inch floppy: COFF format from macro-assembler/linker (preferred) 
5-1/4-inch floppy: COFF format from macro-assembler/linker 

Modem (BBS): COFF format from macro-assembler/linker 

EPROM (others): TMS27C64 

PROM: TBP28S166, TBP28S86 


DOU 


When code is submitted to TI for masking, the code is reformatted to accom- 
modate the Tl mask-generation system. System-level verification by the 
customer is therefore necessary to ensure the reformatting remains transpar- 
ent and does not affect the execution of the algorithm. The formatting changes 
involve the removal of address-relocation information (the code address 
begins at the base address of the ROM in the TMS320 device and progresses 
without gaps to the last address of the ROM) and the addition of data in the 
reserved locations of the ROM for device ROM test. Because these changes 
have been made, a checksum comparison is not a valid means of verification. 


With each masked-device order, the customer must sign a disclaimer that 
states: 


The units to be shipped against this order were assembled, for expe- 
diency purposes, on a prototype (that is, nonproduction qualified) 
manufacturing line, the reliability of which is not fully characterized. 
Therefore, the anticipated inherent reliability of these prototype units 
cannot be expressly defined. 


and a release that states: 


Any masked ROM device may be resymbolized as TI standard 
product and resold as though it were an unprogrammed version of 
the device, at the convenience of Texas Instruments. 


The use of the ROM-protect feature does not hold for this release statement. 
Additional risk and charges are involved when the ROM-protect feature is 
selected. Contact the nearest TI Field Sales Office for more information on 
procedures, leadtimes, and cost associated with the ROM-protect feature. 
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Summary of Updates in This Document 


This appendix provides a summary of the updates in this version of the docu- 
ment. Updates within paragraphs appear in a bold typeface. 


Page: Changed or Added: 
Changed the second sentence of the first paragraph to: 


This book, the first volume of a 5-volume set, serves as a reference for the 
TMS320C54x DSP and provides information for developing hardware and 
software applications using the ’54x. 


Added the following two paragraphs: 


Many device references are shown with an apostrophe (’ ) replacing the usual 
alphanumeric prefix (ex. 549 instead of TMS320C549). You should consult 
the device-specific datasheet if you need more information about the device’s 
nomenclature. 


This user’s guide contains limited information about the enhanced peripherals 
available on the ’5402, 5410, and ’5420 devices. For detailed information on 
the enhanced peripherals, see volume 5 of this reference set: TWS320C54x 
DSP, Enhanced Peripherals, literature number SPRU302. 


Added volume 5 of the TMS320C54x DSP Reference Set to Related Documentation 
From Texas Instruments. 


=k 
1 
(ep) 


Changed the first second-level bullet under Memory to: 


mM 192K words x 16-bit addressable memory space (64K-words pro- 
gram, 64K-words data, and 64K-words I/O), with extended program 
memory in the 548, 549, ’5402, °5410, and 5420. 


Added the following information to the configuration table at the bottom of the page: 


Program Program/Data 


Device ROM ROM DARAMt SARAM+ 
5402 4 0 16 0 
5410 16 0 8 56 
5420 0 0 32 168 
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Page: Changed or Added: 


1-7 Changed the first sentence of the note below the option table to: 


t The '541B, 545A, ’546A, 548, 549, 5402, 5410, and 5420 have a software-pro- 
grammable PLL and two additional saturation modes. 


1-8 Added the following information to the ports table at the top of the page: 
Serial Ports 
Multi- 
Host Port Channel _Time-Division 
Device Interface Synchronous Buffered Buffered Multiplexed 
5402 1 0 0 2 0 
5410 1 0 0 3 0 
5420 1 0 0 6 0 
Added the following information to the power supply, speed, and package table. 
Device Power Supply Speed Package 
'vc5402. 3.3V(1.8core) 10ns 144-pin TQFP/144-pin 
Micro Star™ BGA 
'VC5410 3.3V(2.5core) 10ns 144-pin TQFP/176-pin 
Micro Star™ BGA 
'VC5420 3.3V(1.8core) 10ns 144-pin TQFP/144-pin 
Micro Star™ BGA 
2-5 Added the following information to Table 2-2, Program and Data Memory on the 
TMS320C54x Devices: 
Memory Type *5402 5410 5420 
ROM: 4K 16K 0 
Program 4K 16K 0 
Program/data 0 0 0 
DARAMt 16K 8K 32K 
SARAMt 0 56K 168K 
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Summary of Updates in This Document 


Changed or Added: 
Changed the second paragraph in section 2.2.1, On-Chip ROM, to the following: 


On devices with a small amount of ROM (2K words), the ROM contains a boot- 
loader that is useful for booting to faster on-chip or external RAM. For boot- 
loading details on all ’54x devices except the 548, 549, 5402, and 5410, see 
TMS320C54x DSP Reference Set, Volume 4: Applications Guide. For boot- 
loading details on the 548, 549, and 5410, see TMS320C548/549 Bootload- 
er Technical Reference. 


Changed the first sentence of the third paragraph in section 2.2.1, On-Chip ROM, to 
the following: 


On devices with larger amounts of ROM, a portion of the ROM may be mapped 
into both data and program space (except the 5410). 


Changed the first sentence in section 2.7.2, Software-Programmable Wait-State Gen- 
erator, to: 


The software-programmable wait-state generator extends external bus cycles 
up to seven machine cycles (14 machine cycles in the 549, 5402 °5410, and 
”5420) to interface with slower off-chip memory and I/O devices. 


Added the following information to Table 2-3, Host Port Interfaces on the TMS320C54x 
Devices: 


‘HostPort + 5402.~«“‘<‘é&@wAOO#~«755420~C~*# 
Interface 
Standard 8-bit HPI 0 0 0 
Enhanced 8-bit HPI 1 1 0 
Enhanced 16-bit HPI 0 0 1 


Added the following information to Table 2-4, Serial Port Interfaces on the 
TMS320C54x Devices: 


Serial Ports 5402 5410 5420 
Synchronous 0 0 0 
Buffered 0 0 0 
Multichannel Buffered 2 3 6 
TDM 0 0 0 


Changed the first sentence in section 2.9, External Bus Interface, to: 


The ’54x can address up to 64K words of data memory, 64K words of program 
memory (8M words in the ’548, ’549, and 5410; 1M words in the ’5402; 
256K words in the 5420), and up to 64K words of 16-bit parallel I/O ports. 
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Changed or Added: 


Changed the fourth sentence in the first paragraph to: 


In some devices, the memory structure has been modified through overlay and 
paging schemes to allow additional memory space. 


Deleted the following sentence from the first paragraph in section 3.1, Memory Space: 


Together, these three spaces provide a total address range of 192K words 
(except in the 548, ’549, 5402, 5410, and ’5420). 


Changed the fourth paragraph to: 


There are three CPU status register bits that affect memory configuration. The 
effects of these bits are device-specific. 


Replaced Figure 3-4, Memory Maps for the ‘548, with an updated figure. 
Replaced Figure 3-5, Memory Maps for the ‘549, with an updated figure. 
Added Figure 3-6, Extended Program Memory Maps for the 548 and ’549. 
Added Figure 3—7, Memory Maps for the ‘5402. 

Added Figure 3-8, Extended Program Memory for the ‘5402. 

Added Figure 3-9, Memory Maps for the 5410. 


Added Figure 3-10, Extended Program Memory Maps for the ‘5410 (On-chip RAM Not 
Mapped in Program Space and Data Space, OVLY = 0). 


Added Figure 3-11, Extended Program Memory Maps for the ‘5410 (On-chip RAM 
Mapped in Program Space and Data Space, OVLY = 1). 


Added Figure 3-12, Data Memory Map for the ‘5420 Relative to CPU Subsystems A 
and B. 


Added Figure 3-13, Program Memory Maps for the ‘5420 Relative to CPU Subsystems 
A and B. 


Changed the first sentence in section 3.2, Program Memory, to: 


The external program memory on the 54x devices (except on the 548, ’549, 
"5402, 5410, and 5420) addresses up to 64K 16-bit words. 
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Page: Changed or Added: 
3-15 Added the following information to Table 3-1, On-Chip Program Memory Available on 
the TMS320C54x Devices: 
ROM DARAM SARAM 
Device (MP/MC = 0) (OVLY = 1) (OVLY = 1) 
5402 4K 16K a 
5410 16K 8K 56K 
5420 a 32K 168K 
3-17 Added ROM blocks for the 5402 and 5410 in Figure 3-14, On-Chip ROM Block 
Organization. 
3-18 Changed the first paragraph in section 3.2.4, On-Chip ROM Code Contents and Map- 
ping, to: 


The ’54x devices (except 5420) provide a variety of ROM sizes (4K, 16K, 24K, 
28K, or 48K words). On 54x devices with on-chip bootloader ROM, the 2K 
words (at F800h to FFFFh) may contain one or more of the following, depend- 
ing on the specific device: 


3-19 Changed the label above the memory map on the right to: 


'342/543/548/549/5402/5410 


Changed the description at address FFOOh in both memory maps to: 


Reserved 


3-20 Moved the contents of section 3.1.1 to section 3.2.5 and changed the heading to: 


Extended Program Memory (Available on 548/549/5402/5410/5420) 


Changed the first paragraph to: 


The °548, 549, *5402, 5410, and *5420 use a paged extended memory 
scheme in program space to allow access of up to 8192K words of program 
memory. To implement this scheme, the 548, 549, °5402, 5410, and 5420 
include several additional features: 


Changed the first bulleted statement to: 


(J 23 address lines, instead of 16 (20 address lines in the 5402, and 18 in 
the ’5420) 
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Page: Changed or Added: 
3-20 Changed the second paragraph to: 


Program memory in the 548, 549, ’5402, ’5410, and 5420 is organized into 
128 pages (16 pages in the °5402, and 4 in the ’5420) that are each 64K 
words in length, as shown in Figure 3-16. 


3-21 Changed the sentence above the first bulleted list to: 


To facilitate page switching through software, the 548, ’549, 5402, 5410, and 
"5420 have six special instructions that affect the XPC: 


Changed the sentence above the second bulleted list to: 


The following two ’54x instructions are extended in the ’548, 549, 5402, 
5410, and 5420 to use 23 bits (20 bits in the 5402, and 18 in the ’5420): 


3-22 Added the following information to Table 3-2, On-Chip Data Memory Available on the 
TMS320C54x Devices: 
Program/Data ROM 
Device (DROM = 1) DARAM SARAM 
5402 4K 16K = 
5410 16K 8K 56K 
5420 = 32K 168K 
3-23 Changed the second paragraph in section 3.3.2 to: 
The organization of the first 1K of DARAM on all ’54x devices includes the 
memory-mapped CPU and peripheral registers, 32 words of scratch-pad DA- 
RAM, and 896 words of DARAM. 
3-25 Added Figure 3-19, On-Chip RAM Block Organization (‘5402/54 10/5420) 
3-26 Changed the first bullet as follows: 


(1 Theperipheral registers are used as control and data registers in peripher- 
al circuits. These registers reside within addresses 0020h—O05F and 
reside on a dedicated peripheral bus structure. For a list of peripherals on 
a particular ‘54x device, see section 8.2, Peripheral Memory-Mapped 
Registers, on page 8-2. 
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Changed or Added: 


Changed the XPC description at address 1E in Table 3-3, CPU Memory-Mapped Reg- 
isters, to: 


Address Name Description 
i XPC Program counter extension register (548, 549, 


*5402, 5410, and 5420) 
Changed the heading of section 3.3.4.11 to: 


Program Counter Extension Register (XPC, Available on ’548/°549/°5402/ 
5410/5420) 


Changed the note below Figure 4—3 and Table 4—3 to: 
t Refer to the device-specific datasheet to determine if these bits are supported. 
Changed the second sentence in the note at the bottom of the page to: 


However, because the '548, 549, and 5410 have 23 address lines (20 ad- 

dress lines in the 5402, and 18 in the ’5420), the program-memory location 

in these devices is specified by the lower 23 bits of accumulator A. 
Changed the second and third sentences of the first paragraph to: 


The ’54x can address a total of 64K words of program memory using the pro- 
gram address bus (PAB). Table 6-1 shows devices that have additional pro- 
gram memory address lines that provide external access to as many as 128 
64K-word pages. 


Added Table 6-1, Devices With Additional Program Memory Address Lines. 


Changed the sentence below the fifth bulleted item to: 


One additional register is used in the ’548, 549, 5402, 5410, and ’5420 to 
address extended memory: 


Changed the first sentence in the first paragraph to: 


XPC is a 7-bit register that selects the extended page of program memory for 
the 548, 549, 5402, 5410, and 5420. For more information about extended 
program memory in these devices, see section 3.1.1, Extended Program 
Memory, on page 3-10. 

Changed the heading of section 6.3.3 to: 


Far Branches (Available on TMS320C548/549/5402/54 10/5420) 
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Changed or Added: 
Changed the first sentence of section 6.3.3 to: 


To allow branches to extended memory, the '548, 549, 5402, 5410, and 
5420 include two far branch instructions: 


Changed the bulleted list to: 


(4 FB[D] branches to the 23-bit address (20-bit addresses in the 5402 and 
18-bit addresses in the ’5420) specified by the the instruction. 


_) FBACC[D] branches to the 23-bit address (20-bit addresses in the 5402 
and 18-bit addresses in the ’5420) specified in the designated accumu- 
lator. 


Changed the first sentence under the bulleted list to: 


Table 6-6 shows the far branch instructions in the 548, 549, °5402, 5410, and 
5420 (both nondelayed and delayed) and the number of cycles needed to 
execute these instructions. 

Changed the heading of section 6.4.3 to: 


Far Calls (Available on TMS320C548/549/5402/541 0/5420) 


Changed the first sentence in section 6.4.3 to: 


To allow calls to extended memory, the 548, 549, 5402, 5410, and 5420 in- 
clude two far call instructions: 


Changed the bulleted list to: 


(1 The FCALL instruction pushes XPC onto the stack, pushes PC onto the 
stack, and branches to the 23-bit address (20-bit addresses in the 5402 
and 18-bit addresses in the 5420) specified by the the instruction. 


(1 The FCALA pushes XPC onto the stack, pushes PC onto the stack, and 
branches to the 23-bit address (20-bit addresses in the 5402 and 18-bit 
addresses in the 5420) specified in the designated accumulator. 


Changed the first sentence of the paragraph below the bulleted list to: 


Table 6-9 shows the far call instructions in the 548, 549, 5402, 5410, and 
"5420 (nondelayed and delayed) and the number of cycles needed to execute 
these instructions. 


Changed the second paragraph in section 6.5, Returns, to: 


The 548, 549, 5402, 5410, and ’5420 offer an additional return instruction: 
an unconditional far return, both nondelayed and delayed. 
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6-14 Changed the heading of section 6.5.3 to: 
Far Returns (Available on TMS320C548/549/5402/5410/5420) 


Changed the first sentence in section 6.5.3 to: 


To allow returns from extended memory, the 548, ’549, 5402, 5410, and 
5420 include two far return instructions: 


6-15 Changed the first sentence to: 


Table 6—12 shows the far return instructions in the ’548, ’549, *5402, °5410, 
and °5420 (nondelayed and delayed) and the number of cycles needed to 
execute these instructions. 


6-25 Changed the fifth bulleted item to: 
(1 XPC is cleared ('548, 549, 5402, 5410, and 5420) 


6-29 Added the following registers to Figure 6—2, Interrupt Flag Register (IFR) Diagram: 
(h) 5402 IFR 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
BXINT1 | BRINT1 TINT1 

Resvd DMAC5|DMAC4| _ or or | HPINT | |NT3 or |DMACO/BXINTO|BRINTO) TINTO | INT2 | INT1 | INTO 
DMAC3 | DMAC2 DMAC1 


(i) 5410 IFR 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
BXINT1 | BRINT1 BXINT2 | BRINT2 

Resvd DMAC5|DMAC4] _ or or | HPINT | INT3 or or |BXINTO/}BRINTO|; TINT | INT2 | INT1 | INTO 
DMAC3 | DMAC2 DMAC1 | DMACO 


(j) ‘5420 IFR 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
BXINT1 | BRINT1 BXINT2 | BRINT2 

Resvd | IPINT | DMAC5|DMAC4| __ or or HPINT | RSVD or or |BXINTO|BRINTO) TINT | RSVD | INT1 | INTO 
DMAC3 | DMAC2 DMAC1 | DMACO 
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Page: Changed or Added: 


6-30 Added the following registers to Figure 6-3, Interrupt Mask Register (IMR) Diagram: 
(h) 5402 IMR 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
BXINT1 | BRINT1 TINT1 

Resvd DMAC5 | DMAC4] _ or or | HPINT | INT3 or | DMACO}BXINTO|BRINTO} TINTO | INT2 ) INT1 | INTO 
DMAC3 | DMAC2 DMAC1 


(i) 5410 IMR 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
BXINT1 | BRINT1 BXINT2 | BRINT2 

Resvd DMAC5 | DMAC4] _ or or | HPINT | INT3 or or | BXINTO;}BRINTO} TINT | INT2 ) INT1 | INTO 
DMAC3 | DMAC2 DMAC1 | DMACO 


(j) 5420 IMR 
15 14 13 #12 «11 10 9 8 7 6 5 4 3 2 1 0 
BXINT1 | BRINT1 BXINT2 | BRINT2 
Resvd | IPINT |DMAC5|DMAC4| _ or or | HPINT| RSVD | _ or or |BXINTO}BRINTO} TINT | RSVD | INT1 | INTO 
DMAC3| DMAC2 DMAC1 | DMACO 
6-33 Added the following note: 
Note: 


The program counter extension register, XPC, does not get pushed to the top 
of the stack; that is, it does not get saved on the stack. Therefore, if an ISR 
is located on a different page from the vector table, you must push the XPC 
on the stack prior to branching to the ISR. A FRET [E] can be used to return 


from the ISR. 
SS 


6-45 Added Table 6-26, 5402 Interrupt Locations and Priorities. 
6-46 Added Table 6-27, 5410 Interrupt Locations and Priorities. 
6-48 Added Table 6—28, ‘5420 Interrupt Locations and Priorities. 
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Summary of Updates in This Document 


Changed or Added: 
Added the following information to Table 7-1, DARAM Blocks: 


Device Block Sizet Number of Blocks 


5402 8K words 2 
5410 2K words 4 
5420 (each subsystem) 8K words 2 


Added the following second paragraph: 


Enhanced peripherals, available on specific 54x devices, are not discussed 
at length in this chapter. For detailed information on the enhanced peripherals, 
see volume 5 of this reference set: TMS320C54x DSP, Enhanced Peripherals, 
literature number SPRU302. 


Added section 8.1, Available On-chip Peripherals, and inserted the bulleted list of pe- 
ripherals from page 1. 


Changed the fourth bulleted item to: 


(1 ~Host port interface 
M 8-bit standard (’542, ’545, 548, ’549) 
M 8-bit enhanced (’5402, 5410 — see note below) 
m 16-bit enhanced (’5420 — see note below) 


Added the seventh bulleted item: 


(1 Multichannel buffered serial port (McBSP) ('5402, 5410, and 5420 — see 
note below) 


Added the following note below the bulleted list: 


TTT 
Note: Enhanced Peripherals 


For more detailed information on the enhanced peripherals, see volume 5 
of this reference set: TMS320C54x DSP, Enhanced Peripherals, literature 
number SPRU302. 


|) 


Deleted the following sentence from the second paragraph in section 8.2, Peripheral 
Memory-Mapped Registers: 


Note that all accesses to memory-mapped peripheral registers require two 
machine cycles. 
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Changed or Added: 
Deleted the figure titled TMS320C54x Peripheral Memory Mapped Registers. 


Added registers for the 5402, ‘5410, and 5420 devices to Figure 8-1, TMS320C54x 
Peripheral Memory-Mapped Registers. 


Added Table 8-8, 5402 Peripheral-Memory Mapped Registers. 

Added Table 8-9, 5410 Peripheral-Memory Mapped Registers. 

Added Table 8-10, 5420 Peripheral-Memory Mapped Registers. 

Added Table 8-11, ‘5402/5410/5420 McBSP Sub-addressed Registers. 
Added Table 8-12, 5402/5410/'5420 DMA Sub-addressed Registers. 


Deleted the following sentence from section 8.3, General-Purpose I/O: 


For more information about the I/O memory space and accesses using the ex- 
ternal bus, see section 3.4, //O Memory, on page 3-2/, and chapter 10, Exter- 
nal Bus Operation. 


Deleted the figure titled BIO Timing Diagram. 


Deleted the last sentence of the paragraph in section 8.4, Timer, and added the 
following: 


The timer resolution is the CPU clock rate of the processor. The high dynamic 
range of the timer is achieved with a 16-bit counter with a 4-bit prescaler. The 
5402 and the 5420 have two on-chip timers. 

Changed the second sentence of the first paragraph to: 


The timer is driven by aprescaler that is decremented by 1 at every CPU clock 
cycle. 


Changed the last sentence of the second paragraph to: 


The timer is clocked by the CPU clock. 


Changed Figure 8-3, Timer Block Diagram, to show that the timer is clocked by the 
CPU clock. 


Added the following note to the bottom of the page: 


T Note that on the ’5402, the timer1 output (TOUT 1) is only available when the HPI-8 is disabled, 
and the TOUT1 bit is set in the GPIO control register. 
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Changed or Added: 

Changed the sixth and seventh sentences of the first paragraph to: 
PSC is clocked by the device CPU clock. Each CPU clock decrements PSC 
by 1. 

Changed the sentence above the equation to: 


The timer interrupt (TINT) rate is equal to the CPU clock frequency divided by 
two independent factors: 


Changed the sentence below the equation to: 


In the equation, tc) is the period of CPU clock, uis the sum of the TDDR con- 
tents plus 1, and|vjis the sum of the PRD contents plus 1. 


Changed timer initialization step number three to: 
Start the timer by reloading TCR to initialize TDDR. Enable the timer by setting 
TSS to 0 and TRB to 1 to reload the timer period. 

Added the following note to the bottom of the page: 


Tt Note that on the 5402, the timer1 output (TOUT 1) is only available when the HPI-8 is disabled, 
and the TOUT1 bit is set in the GPIO control register. 


Changed the first bulleted item to: 


(1 Acrystal resonator with the internal oscillator circuit. The crystal resonator 
circuit is connected across the X1 and X2/CLKIN pins of the 54x. The 
CLKMD pins must be configured to enable the internal oscillator. 


Changed the last sentence in the paragraph under the bulleted list to: 


The ’541B, ’545A, ’546A, ’548, 549, 5402, 5410, and 5420 devices use a 
software-programmable PLL. 


Replaced all references to “internal source” with “oscillator enabled” in Table 8-15, 
Clock Mode Configurations. 


Changed the heading of section 8.5.2 to: 


Software-Programmable PLL ('541B/545A/546A/ 
548/549/5402/5410/54208) 


Added the following note below the last paragraph: 


§ The VC5420 device does not have CLKMD pins. Following reset, the 5420 operates in bypass 
mode (PLL is off). 
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Changed or Added: 


Changed the clock mode descriptions at 4000h and 7000h in Table 8-16, Clock Mode 
Setting at Reset ('541B/545A/546A/548/'549/'54 10), to: 


CLKMD 
CLKMD1 CLKMD2 CLKMD3 Reset Value Clock Mode 
1 0 0 4000h Divide-by-2, internal oscillator 
enabled 
1 1 1 7000h Divide-by-2, internal oscillator 
enabledt 


Changed the note below Table 8-16 to: 


t Reserved on ’549 and ’5410 
Added Table 8-17, Clock Mode Settings at Reset ('5402). 


Added the following note below Table 8-19, PII Multiplier ratio as a Function of 
PLLNDIV, PLLDIV, and PLLMUL: 


+ This is the default mode for the °5420 after reset. 


Changed the “549 Only” note referencing the graph in Figure 8-7, PLL Lockup Time 
Versus CLKOUT Frequency, to: 


349, and 5410 


Changed the first and second sentences of the third paragraph under Switching Clock 
Mode From PLL Mode to DIV Mode to: 


The switch to DIV mode takes effect in 6 CLKIN cycles plus 3.5 PLL cycles for 
all PLLMUL values except 1111b. For a PLLMUL value of 1111b, the switch to 
DIV mode takes effect in 12 CLKIN cycles plus 3.5 PLL cycles. 


Changed the second sentence of the paragraph under the numbered list to: 


When the PLLCOUNT timer reaches 0, the new PLL mode takes effect after 
6 CLKIN cycles plus 3.5 PLL cycles. 


Changed the first sentence in section 8.6, Host Port Interface, to: 


The standard host port interface (HPI) is available on the 542, ’545, 548, and 
549 devices. 


oO 
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Changed or Added: 
Added the following second paragraph: 


Enhanced host port interfaces are available on the 5402, 5410 (HPI-8) and 
5420 (HPI-16) devices. This chapter does not describe these enhanced HPIs. 
For more information on the HPI-8 and HPI-16, see volume 5 of this reference 
set, 7MS320C54x DSP, Enhanced Peripherals, literature number SPRU302. 


Added the third bulleted item: 
(1 Multichannel buffered serial Port (McBSP) interface 


Added the following information to Table 9—1, Serial Ports on the TMS320C54x 
Devices: 


Standard Multichannel Time-Division 
Synchronous Buffered Buffered Multiplexed 
Device Serial Ports Serial Ports Serial Ports Serial Ports 
"5402 0 0 2 0 
5410 0 0 8 0 
"5420 0 0 6 0 


Added the following reference to Table 9-2, Sections that Cover the Serial Ports: 


Serial Port Mode See... 


MCBSP Multichannel TMS320C54x DSP Enhanced Peripherals 
Volume 5 (literature number SPRU302) 


Changed the first paragraph to: 


Four different types of serial port interfaces are available on 54x devices. The 
basic standard serial port interface is implemented on ’541, ’545, and ’546 de- 
vices. The TDM serial port interface is implemented on the ’542, '543, 548, 
and '549 devices. The '542, 543, ’545, ’546, 548, and 549 devices include 
a buffered serial port (BSP) that implements an automatic buffering feature, 
which greatly reduces CPU overhead required in handling serial data trans- 
fers. The 5402, 5410, and 5420 devices include multichannel buffered 
serial ports (McBSPs). See Table 9-1 for information about the features in- 
cluded in various 54x devices. 
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Page: Changed or Added: 
10-1 Added the following paragraph: 


The 5410 enhanced external parallel interface (XIO) is not described in this 
chapter. See the 5410 datasheet for details about the external memory inter- 


face. 
10-2 Added signal names and descriptions for the ’5402, 5410, and '5420 devices to 
Table 10-1, Key External Interface Signals, as follows: 
548, 549 5402 5420 
Signal Name 65410 Description 
AO-A15 22-0 19-0 17-0 Address bus 
DO-D15 15-0 15-0 15-0 Data bus 
MSTRB in Val Val External memory access 
strobe 
PS in val al Program space select 
DS Val Val Val Data space select 
IOSTRB a a al I/O access strobe 
is a al al I/O space select 
R/W a a a Read/write signal 
READY al a in Data ready to complete cycle 
HOLD a a Hold request 
HOLDA i i Hold acknowledge 
MSC a a Micro state complete 
IAQ al Val Instruction acquisition 
TACK a a Interrupt acknowledge 
10-3 Added the following paragraph: 


When the CPU addresses external data or I/O space, the extended address 
lines are driven to logic 0. This is also the case when the CPU addresses inter- 
nal memory with the AVIS (address visibility) set to 1. 
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Page: Changed or Added: 
10-5 Changed the first sentence in the first paragraph in section 10.3.1, Wait State Genera- 
tor, to: 


The software-programmable wait-state generator can extend external bus 
cycles by up to seven machine cycles (14 machine cycles on ’549, *5402, 
5410, and 5420 devices), providing a convenient means to interface the ’54x 
to slower external devices. 


Added the following sentence to the third paragraph in section 10.3.1, Wait State Gen- 
erator,: 


The SWWSR bit fields of the 548, 549, 5402, 5410, and 5420 are described 
in Table 10-8. 


Changed the note below Figure 10-2, Software Wait-State Register (SWWSR) Dia- 
gram, to: 
t XPA bit on 548, 549, 5402, 5410, and 5420 only 


=e 
xm 
Oo 


Changed the title of Figure 10-3 to Software Wait-State Register (SWWSR) Bit Sum- 
mary, and changed the figure as follows: 


15 1 0 


smn [Remand 


Added the second paragraph below Figure 10-3, Software Wait-State Register 
(SWWSR) Bit Summary, as follows: 


The '549, 5402, ’5410, and 5420 have an extra bit (software wait-state multi- 
plier, SWSM) that resides in SWCR, which is memory mapped to address 
002Bh in data space. 


10-7 Changed the title of Table 10-3 to: 


'548/549/5402/5410/5420 Software Wait-State Register (SWWSR) Bit Sum- 
mary 


10-9 Changed Figure 10-5, Bank-Switching Control Register (BSCR) Diagram, as follows: 


15-12 11 10-9 8 7-3 2 1 0 


[aNKGNE [PS-O5] Resened [PRG Pewee | 
R/W R/W R/W 


R/W R/W R/W 
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Changed or Added: 


Changed the name and function descriptions in Table 10-5, Bank-Switching Control 
Register (BSCR) Bit Summary, as follows: 


Reset 
Name Value Function 
PS-DS - Program read-data read access. Inserts an extra cycle be- 
tween consecutive accesses of program read and data read, 
or data read and program read. 

PS-DS =0 No extra cycles are inserted by this feature ex- 
cept when banks are crossed. 

PS-DS = 1 One extra cycle is inserted between consecu- 
tive accesses of program read and data read, 
or data read and program read. 

Reserved = These bits are reserved. 

IPIRQ = Interprocessor interrupt request bit. 
Reserved - These bits are reserved. 

HBH - HPI bus holder bit. 


Changed the first signal designation in Table 10—7, State of Signals When External In- 
terface is Disabled (EXIO=1), to: 


Signal State 
A(22-0) Previous state 


Changed the third bulleted item to: 


Li Aprogram-memory read followed by another program-memory read from 
a different page (with the 548, ’549, 5402, 5410, and ’5420). 


Changed Figure 10-8, Memory Interface Operation for Read-Read-Write, to show Ad- 
dress as the designation for the second timing event, and Data as the designation for 
the third timing event. 


Changed Figure 10-9, Memory Interface Operation for Write-Write-Read, to show Ad- 
dress as the designation for the second timing event, and Data as the designation for 
the third timing event. 


Changed the last sentence to: 


However, the write, which is already two cycles, is extended to three cycles. 
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Changed or Added: 


Changed Figure 10-10, Memory Interface Operation for Read-Read-Write (Program- 
Space Wait States), to show Address as the designation for the second timing event, 
and Data as the designation for the third timing event. 


Changed Figure 10-11, Parallel |/O Interface Operation for Read-Write-Read, to show 
Address as the designation for the second timing event, and Data as the designation 
for the third timing event. 


Changed Figure 10—12, Parallel I/O Operation for Read-Write-Read (l/O-Space Wait 
States), to show Address as the designation for the second timing event, and Data as 
the designation for the third timing event. 


Changed Figure 10-13, Memory Read and |/O Write, to show Address as the designa- 
tion for the second timing event, and Data as the designation for the third timing event. 


Changed Figure 10-14, Memory Read and |/O Read, to show Address as the designa- 
tion for the second timing event, and Data as the designation for the third timing event. 


Changed Figure 10-15, Memory Write and \/O Write, to show Address as the designa- 
tion for the second timing event, and Data as the designation for the third timing event. 


Changed Figure 10-16, Memory Write and |/O Read, to show Address as the designa- 
tion for the second timing event, and Data as the designation for the third timing event. 


Changed Figure 10-17, I/O Write and Memory Write, to show Address as the designa- 
tion for the second timing event, and Data as the designation for the third timing event. 


Changed Figure 10-18, I/O Write and Memory Read, to show Address as the designa- 
tion for the second timing event, and Data as the designation for the third timing event. 


Changed Figure 10-19, I/O Read and Memory Write, to show Address as the designa- 
tion for the second timing event, and Data as the designation for the third timing event. 


Changed Figure 10-20, I/O Read and Memory Read, to show Address as the designa- 
tion for the second timing event, and Data as the designation for the third timing event. 


Changed Figure 10-21, External Bus Reset Sequence, to show Address as the desig- 
nation for the third timing event, and Data as the designation for the fourth timing event. 


Changed Figure 10-23, HOLD and HOLDA Minimum Timing for HM = 0, to show Ad- 


dress as the designation for the fourth timing event, and Data as the designation for the 
fifth timing event. 


Summary of Updates in This Document D-19 


Summary of Updates in This Document 


Page: 
10-31 


> 
co 


oe 
Sl 


D-20 


Changed or Added: 


Changed Figure 10-24, HOLD and RS Interaction, to show Address as the designation 
for the fifth timing event, and Data as the designation for the sixth timing event in each 
of the four figures. 


Added the following description below the Figure 24 caption for each of the four figures: 
(a) Hold is Asserted While Reset is Active and De-asserted While Reset is Active 
(b) Hold is Asserted While Reset is Active and De-asserted While Reset is Inactive 
(c) Hold is Asserted and De-asserted While Reset is Inactive 


(d) Hold is Asserted While Reset is Inactive and De-asserted While Reset is Active 


Replaced Example B—2, Key Timing for a Single- or Multiple-Processor System With 
Buffered Input and Output, with an updated version. 


Replaced Figure B—1, TMS320 DSP Device Nomenclature, with an updated version. 
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Glossary 


A: See accumulator A. 
ABU: See autobuffering unit. 


ABUC: ABU control register. A register that controls the operation of the 
autobuffering unit. 


accumulator: A register that stores the results of an operation and provides 
an input for subsequent arithmetic logic unit (ALU) operations. 


accumulator A: 40-bit register that stores the result of an operation and 
provides an input for subsequent arithmetic logic unit (ALU) operations. 


accumulator B: 40-bit registers that stores the result of an operation and 
provides an input for subsequent arithmetic logic unit (ALU) operations. 


accumulator shift mode field (ASM): A 5-bit field in status register 1 (ST1) 
that specifies a shift value (from —16 to 15) used to shift an accumulator value 
when executing certain instructions, such as instructions with parallel loads 
and stores. 


adder: A unit that adds or subtracts two numbers. 
address: The location of a word in memory. 


addressbus: A group ofconnections used to route addresses. The 54x has 
four 16-bit address busses: CAB, DAB, EAB, and PAB. 


addressing mode: The method by which an instruction calculates the location 
of an object in memory. 


address visibility mode (AVIS): A bit in processor mode status register 
(PMST) that determines whether or not the internal program address 
appears on the device’s external address bus pins. 


AG: accumulator guard bits. An 8-bit register that contains bits 39-32 (the 
guard bits) of accumulator A. 
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E-2 


AH: accumulator A high word. Bits 31-16 of accumulator A. 
AL: accumulator A low word. Bits15—0 of accumulator A. 


ALU: arithmetic logic unit. The part of the CPU that performs arithmetic 
and logic operations. 


analog-to-digital (A/D) converter: Circuitry that translates an analog 
signal to a digital signal. 


ARO-AR7: auxiliary registers 0-7. Eight 16-bit registers that can be 
accessed by the CPU and modified by the auxiliary register arithmetic 
units (ARAUs) and are used primarily for data memory addressing. 


ARAU: See auxiliary register arithmetic unit. 
ARP: See auxiliary register pointer. 


ARR, ARRO, ARR1: ABU address receive register A 16-bit register that 
specifies the destination address at which the autobuffering unit begins 
storing received data. 


ASM: See accumulator shift mode field. 


autobuffering receiver enable (BRE): A bit in the BSP control extension 
register (BSPCE) that enables/disables the autobuffering receiver. 


autobuffering receiver halt (HALTR): A bit in the BSP control extension 
register (BSPCE) that enables/disables the autobuffer receiver when the 
current half of the buffer is received. 


autobuffering transmitter enable (BXE): Abitinthe BSP control extension 
register (BSPCE) that enables/disables the autobuffering transmitter. 


autobuffering transmitter halt (HALTX): Abitinthe BSP control extension 
extension (BSPCE) that enables/disables the autobuffer transmitter 
when the current half of the buffer has been transmitted. 


autobuffering unit: An extension to the synchronous serial port that reads 
and writes data to the synchronous serial port independent of the CPU. 


auxiliary register arithmetic unit: An unsigned, 16-bit arithmetic logic unit 
(ALU) used to calculate indirect addresses using auxiliary registers. 


auxiliary register file: The area in data memory containing the eight 16-bit 
auxiliary registers. See also auxiliary registers. 


auxiliary register pointer (ARP): A 3-bitfield in status register 0 (STO) used 
as a pointer to the currently-selected auxiliary register, when the device 
is operating in ‘C5x/’C2xx compatibility mode. 


Glossary 


auxiliary registers: Eight 16-bit registers (AR7 — ARO) that are used as 
pointers to an address within data space. These registers are operated on 
by the auxiliary register arithmetic units (ARAUs) and are selected by the 
auxiliary register pointer (ARP). See also auxiliary register arithmetic unit. 


AVIS: See address visibility mode bit. 


AXR, AXRO, AXR1: ABU address transmit register. A 16-bit register that 
specifies the source address from which the autobuffering unit begins 
transmitting data. 


B: See accumulator B. 


bank-switching control register (BSCR): A 16-bit register that defines the 
external memory bank size and enables or disables automatic insertion 
of extra cycles when accesses cross memory bank boundaries. 


barrel shifter: A unit that rotates bits in a word. 


BDRR, BDRRO, BDRR1: BSP daia receive register. Two 16-bit registers 
used to receive data through the buffered serial ports. BDRRO 
corresponds to buffered serial port 0 and BDRR1 corresponds to buff- 
ered serial port 1. 


BDXR, BDXRO, BDXR1: BSP data transmit register. Two 16-bit registers 
used to transmit data through the buffered serial ports. BDXRO 
corresponds to buffered serial port 0 and BDXR1 corresponds to buff- 
ered serial port 1. 


BG: accumulator B guard bits. An 8-bit register that contains bits 39-32 
(the guard bits) of accumulator B. 


BH: accumulator B high word. Bits 31-16 of accumulator B. 


BIO: Ageneral purpose, branch-control, input pin that can be used to moni- 
tor the status of peripheral devices. 


BK: See circular buffer size register. 


BKR, BKRO,BKR1: ABU receive buffer size register. A 16-bit register that 
sets the size of the receive buffer for the autobuffering unit. 


BKX, BKX0, BKX1: ABU transmit buffer size register. A 16-bit register that 
sets the size of the transmit buffer for the autobuffering unit. 


BL: accumulator B low word. Bits 15—0 of accumulator B. 
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block-repeat active flag (BRAF): A bit in status register 1 (ST1) that indi- 
cates whether or not a block repeat is currently active. 


block-repeat counter (BRC): A 16-bit register that specifies the number of 
times a block of code is to be repeated when a block repeat is performed. 


block-repeat end address register (REA): A 16-bit memory-mapped 
register containing the end address of a code segment being repeated. 


block-repeat start address register (RSA): A 16-bit memory-mapped 
register containing the start address of a code segment being repeated. 


BMINT: See buffer misalignment interrupt. 
boot: The process of loading a program into program memory. 


bootloader: A built-in segment of code that transfers code from an external 
source to program memory at power-up. 


BRAF: See block-repeat active flag. 

BRC: See block-repeat counter. 

BRE: See autobuffering receiver enable. 

BRINT, BRINTO, BRINT1: See BSP receive interrupt. 


BRSR: BSP data receive shift register. A 16-bit register that holds serial 
data received from the BDR pin. See also BDAR. 


BSCR: See bank-switching control register. 


BSP: buffered serial port. An enhanced synchronous serial port that 
includes an autobuffering unit (ABU) that reduces CPU overhead in 
performing serial operations. 


BSP receive interrupt (BRINT, BRINTO, BRINT1): A bitin the interrupt flag 
register (IFR) that indicates the BSP data receive shift register (BRSR) 
contents have been copied to the BSP data receive register (BDRR). 
BRINTO corresponds to buffered serial port O and BRINT1 corresponds to 
buffered serial port 1. 


BSP transmit interrupt (BXINT, BXINTO, BXINT1): Abitinthe interrupt flag 
register (IFR) that indicates the the BSP data transmit register (BDXR) 
contents has been copied to the BSP data transmit shift register (BXSR). 
BXINTO corresponds to buffered serial port O and BXINT1 corresponds to 
buffered serial port 1. 


BSPC, BSPCO, BSPC1: Buffered serial port control registers 0 and 1. A 
16-bit register that contains status and control bits for the buffered serial 
port. BSPCO corresponds to buffered serial port 0 and BSPC1 corre- 
sponds to buffered serial port 1. 


Glossary 


BSPCE, BSPCE0, BSPCE1: BSP control extension register. A 16-bit 
register that contains status and control bits for the buffered serial port 
(BSP) interface. The 10 LSBs of the BSPCE are dedicated to serial port 
interface control, whereas the 6 MSBs are used for autobuffering unit 
(ABU) control. 


buffer misalignment interrupt (BMINT): A’549 feature that detects poten- 
tial error conditions and indicates lost words on a serial port interface. 


burstmode: Asynchronous serial port mode in which a single word is trans- 
mitted following a frame synchronization pulse (FSX and FSR). 


butterfly: A kernel function for computing an N-point fast Fourier transform 
(FFT), where N is a power of 2. The combinational pattern of inputs 
resembles butterfly wings. 


BXE: See autobuffering transmitter enable. 


BXSR: BSP data transmit shift register. A 16-bit register that holds serial 
data to be transmitted from the BDX pin. See also BDXR. 


C: See carry bit. 


C16: A bit in status register 1 (ST1) that determines whether the ALU 
operates in dual 16-bit mode or in double-precision mode. 


CAB: Caddress bus. Abus that carries addresses needed for accessing 
data memory. 


carry bit (C): A bit in status register 0 (STO) used by the ALU in extended 
arithmetic operations and accumulator shifts and rotates. The carry bit 
can be tested by conditional instructions. 


CB: C bus. A bus that carries operands that are read from data memory. 


circular buffer size register (BK): A 16-bit register used by the auxiliary 
register arithmetic units (ARAUs) to specify the data-block size in circular 
addressing. 


CLKDV: See internal transmit clock division factor. 
CLKP: See clock polarity. 


CLKOUT off (CLKOFF): A bit in processor mode status register (PMST) 
that enables/disables the CLKOUT output. 
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clock generator: A device consisting of an internal oscillator and a phase- 
locked loop (PLL) circuit driven internally by a crystal resonator with the 
internal oscillator, or externally by a clock source. 


clock mode (MCM): A bit in the serial port control register (SPC), buffered 
serial port control register (BSPC), and TDM serial port control register 
(TSPC) that specifies the source of the clock for CLKX. 


clock polarity (CLKP): Abitin the BSP control extension register (BSPCE) 
that indicates when the data is sampled by the receiver and sent by the 
transmitter. 


CMPT: See compatibility mode. 


code: Asetof instructions written to perform a task; a computer program or 
part of a program. 


cold boot: The process of loading a program into program memory at 
power-up. 


compare, select, and store unit(CSSU): An application-specific hardware 
unit dedicated to add/compare/select operations of the Viterbi operator. 


compatibility mode(CMPT): Abitinstatus register 1 (ST1) that determines 
whether or not the auxiliary register pointer (ARP) is used to select an 
auxiliary register in single indirect addressing mode. 


compiler mode (CPL): A bit in status register 1 (ST1) that determines 
whether the CPU uses the data page pointer or the stack pointer to 
generate data memory addresses in direct addressing mode. 


continuous mode: Asynchronous serial port mode in which only one frame 
synchronization pulse (FSX and FSR) is necessary to transmit several 
packets at maximum frequency. 


CPL: See compiler mode. 


CSSU: See compare, select, and store unit. 


DAB: D address bus. Abus that carries addresses needed for accessing 
data memory. 


DAB address register (DAR): A register that holds the address to be put 
on the DAB to address data memory for reads via the DB. 


DAGEN: See data-address generation logic (DAGEN). 


Glossary 


DAR: See DAB address register. 


DARAM: dual-access RAM. Memory that can be accessed twice in the 
same clock cycle. 


data address bus: A group of connections used to route data memory 
addresses. The ’54x has three 16-bit buses that carry data memory 
addresses: CAB, DAB, and EAB. 


data-address generation logic (DAGEN): Logic circuitry that generates 
the addresses for data memory reads and writes. See also program- 
address generation logic (PAGEN). 


data bus: A group of connections used to route data. The ’54x has three 
16-bit data buses: CB, DB, and EB. 


data memory: A memory region used for storing and manipulating data. 
Addresses 00h—1Fh of data memory contain CPU registers. Addresses 
20h—5Fh of data memory contain peripheral registers. 


data page pointer (DP): A 9-bit field in status register 0 (STO) that specifies 
which of 512, 128 x 16 word pages is currently selected for direct address 
generation. DP provides the nine MSBs of the data-memory address; the 
dma provides the lower seven. See also dma. 


data ROM (DROM): A bit in processor mode status register (PMST) that 
determines whether or not part of the on-chip ROM is mapped into data 
space. 


DB: Dobus. Abus that carries operands that are read from data memory. 


digital loopback mode: A synchronous serial port test mode in which the 
DLB bit connects the receive pins to the transmit pins on the same device 
to test if the port is operating correctly. 


digital loopback mode (DLB) bit: A bit in the serial port control register 
(SPC), buffered serial port control register (BSPC), and TDM serial port 
control register (TSPC) that puts the serial port in digital loopback mode. 


digital-to-analog (D/A) converter: Circuitry that translates a digital signal 
to an analog signal. 


direct data-memory address bus: A 16-bit bus that carries the direct 
address for data memory. 


direct memory address (dma, DMA) :_ The seven LSBs of a direct-ad- 
dressed instruction that are concatenated with the data page pointer 
(DP) to generate the entire data memory address. See also data page 
pointer. 
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dma: See direct memory address. 
DP: See data page pointer. 


DRB: directdata-memory address bus. A 16-bit bus that carries the direct 
address for data memory. 


DROM: See data ROM. 


DRR, DRRO, DRR1: serial port data receive register. Two 16-bit registers 
used to receive data through the synchronous serial ports. DRRO corre- 
sponds to synchronous serial port 0 and DRR1 corresponds to synchro- 
nous serial port 1. 


DSP interrupt (DSPINT): A bit in the HPI control register (HPIC) that en- 
ables/disables an interrupt from a host device to the 54x. 


DXR, DXRO, DXR1: serial port data transmit register. Two 16-bit registers 
used to transmit data through the synchronous serial ports. DXRO corre- 
sponds to synchronous serial port 0 and DXR1 corresponds to synchro- 
nous serial port 1. 


EAB: E address bus. Abus that carries addresses needed for accessing 
data memory. 


EAB address register (EAR): Aregister that holds the address to be put on 
the EAB to address data memory for reads via the EB. 


EB: Ebus. Abus that carries data to be written to memory. 


exponent encoder (EXP): A hardware device that computes the exponent 
value of the accumulator. 


external interrupt: A hardware interrupt triggered by a pin (INTO-INT3). 


fast Fourier transform (FFT): Anefficient method of computing the discrete 
Fourier transform, which transforms functions between the time domain 
and frequency domain. The time-to-frequency domain is called the 
forward transform, and the frequency-to-time domain is called the 
inverse transformation. See also buttertly. 


fast return register (RTN): A 16-bit register used to hold the return address 
for the fast return from interrupt (RETF[D]) instruction. 
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FE: See format extension. 
FIG: See frame ignore. 


format(FO): Abitinthe serial port control register (SPC), buffered serial port 
control register (BSPC), and TDM serial port control register (TSPC) that 
specifies the word length of the serial port transmitter and receiver. 


format extension (FE): Abitinthe BSP control extension register (BSPCE) 
used in conjunction with the format bit (FO) to specify the word length of 
the BSP serial port transmitter and receiver. 


frame ignore (FIG): A bit in the BSP control extension register (BSPCE) 
used only in transmit continuous mode with external frame and in receive 
continuous mode. 


frame synchronization mode (FSM): A bitin the serial port control register 
(SPC), buffered serial port control register (BSPC), and TDM serial port 
control register (TSPC) that specifies whether frame synchronization 
pulses (FSX and FSR) are required for serial port operation. 


frame synchronization polarity (FSP): A bit in the BSP control extension 
register (BSPCE) that determines the status of the frame synchroniza- 
tion (FSX and FSR) pulses. 


fractional mode (FRCT): A bit in status register 1 (ST1) that determines 
whether or not the multiplier output is left-shifted by one bit. 


Free bit: A bit in the serial port control register (SPC), buffered serial port 
control register (BSPC), timer control register (TCR), and TDM serial port 
control register (TSPC) used in conjunction with the Soft bit to determine 
the state of the serial port or timer clock when a breakpoint is encoun- 
tered in the high-level language debugger. See also Soft bit. 


FSM: See frame synchronization mode. 


FSP: See frame synchronization polarity. 


general-purpose input/output pins: Pins that can be used to supply input 
signals from an external device or output signals to an external device. 
These pins are not linked to specific uses; rather, they provide input or 
output signals for a variety of purposes. These pins include the general- 
purpose BIO input pin and XF output pin. 
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HALTR: See autobuffering receiver halt. 
HALTX: See autobuffering transmitter halt. 


hardware interrupt: An interrupt triggered through physical connections 
with on-chip peripherals or external devices. 


HINT: ‘54x-to-Host Processor Interrupt. A bit in the HPI control register 
(HPIC) that enables/disables an interrupt from the ’54x to a host device. 


HM: See hold mode. 


hold mode (HM): A bitin status register ST1 that determines whether the 
CPU enters the hold state in normal mode or concurrent mode. 


host-only mode (HOM): The mode that allows the host to access HPI 
memory while the 54x is in IDLE2 (all internal clocks stopped) or in reset 
mode. 


host port interface (HPI): An 8-bit parallel interface that the CPU uses to 
communicate with a host processor. 


HPl address register (HPIA): A 16-bit register that stores the address of the 
host port interface (HPI) memory block. The HPIA can be preincrem- 
ented or postincremented. 


HPI control register (HPIC): A 16-bit register that contains status and 
control bits for the host port interface (HPI). 


IFR: See interrupt flag register. 
IMR: See interrupt mask register. 


INO: inputO bit. Abitinthe serial port control register (SPC), buffered serial 
port control register (BSPC), and TDM serial port control register (TSPC) 
that allows the CLKR pin to be used as an input. INO reflects the current 
level of the CLKR pin of the device. 


IN1: input 7 bit. Abitinthe serial port control register (SPC), buffered serial 
port control register (BSPC), and TDM serial port control register (TSPC) 
that allows the CLKX pin to be used as an input. IN1 reflects the current 
level of the CLKX pin of the device. 
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internal transmit clock division factor (CLKDV): A 5-bit field in the BSP 
control extension register (BSPCE) that determines the internal transmit 
clock duty cycle. 


interrupt: A condition caused by internal hardware, an event external to the 
CPU, or by a previously executed instruction that forces the current 
program to be suspended and causes the processor to execute an inter- 
rupt service routine corresponding to the interrupt. 


interrupt flag register (IFR): A 16-bit memory-mapped register that flags 
pending interrupts. 


interrupt mask register (IMR): A 16-bit memory-mapped register that 
masks external and internal interrupts. 


interrupt mode (INTM): A bit in status register 1 (ST1) that globally masks 
or enables all interrupts. 


interrupt service routine (ISR): A module of code that is executed in 
response to a hardware or software interrupt. 


IPTR: interruptvector pointer A 9-bit field in the processor mode status reg- 
ister (PMST) that points to the 128-word page where interrupt vectors 
reside. 


IR: instruction register. A 16-bit register used to hold a fetched instruction. 


latency: The delay between when a condition occurs and when the device 
reacts to the condition. Also, in a pipeline, the necessary delay between 
the execution of two instructions to ensure that the values used by the 
second instruction are correct. 


LSB: /east significant bit. The lowest order bit in a word. 


maskable interrupts: A hardware interrupt that can be enabled or disabled 
through software. 


McBSP: See multichannel buffered serial port. 
MCM: See clock mode. 


memory map: A map of the addressable memory space accessed by the 
54x processor partitioned according to functionality (memory, registers, 
etc.). 


Glossary E-11 


Glossary 


E-12 


memory-mapped register (MMR): The ’54x processor registers mapped 
into page 0 of the data memory space. 


microcomputer mode: A mode in which the on-chip ROM is enabled and 
addressable for program accesses. 


microprocessor/microcomputer (MP/MC): A bit in the processor mode 
status register (PMST) that indicates whether the processor is operating 
in microprocessor or microcomputer mode. See also microcomputer 
mode; microprocessor mode. 


microprocessor mode: A mode in which the on-chip ROM is disabled for 
program accesses. 


micro stack: A stack that provides temporary storage for the address of the 
next instruction to be fetched when the program address generation logic 
is used to generate sequential addresses in data space. 


MSB: most significant bit. The highest order bit in a word. 


multichannel buffered serial port (McBSP): High-speed, full duplexed, 
buffered serial ports that allow direct interface to other ’C54x devices, co- 
decs, and other devices in a system. The McBSPs provide full-duplex 
communication, multi-buffered data registers, independent framing and 
clocking for receive and transmit, and a flexible clock generator that can 
be programmed for internal or external shift clocking. 


multiplier: A 17-bit x 17-bit multiplier that generates a 32-bit product. The 
multiplier executes multiple operations in a single cycle and operates 
using either signed or unsigned 2s-complement arithmetic. 


nested interrupt: A higher-priority interrupt that must be serviced before 
completion of the current interrupt service routine (ISR). An executing 
ISR can set the interrupt mask register (IMR) bits to prevent being 
suspended by another interrupt. 


nonmaskable interrupt: An interrupt that can be neither masked by the 
interrupt mask register (IMR) nor disabled by the INTM bit of status 
register 1 (ST1). 


OVA: overflow flag A. A bit in status registerO (STO) that indicates the 
overflow condition of accumulator A. 
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OVB: overflow flag B. A bit in status register 0 (STO) that indicates the 
overflow condition of accumulator B. 


overflow: Acondition in which the result of an arithmetic operation exceeds 
the capacity of the register used to hold that result. 


overflow flag: A flag that indicates whether or not an arithmetic operation 
has exceeded the capacity of the corresponding register. 


OVLY: See RAM overlay. 


OVM: overflow mode bit. Abitin status register 1 (ST1) that specifies how 
the ALU handles an overflow after an operation. 


PAB: See program address bus. 

PAGEN: See program-address generation logic (PAGEN). 

PB: See program data bus. 

PC: See program counter. 

PCM: See pulse coded modulation mode. 

pipeline: A method of executing instructions in an assembly-line fashion. 


pmad: program-memory address. A 16-bit immediate program-memory 
address. 


PMST: processor mode status register. A 16-bit status register that 
controls the memory configuration of the device. See also STO, ST7. 


pop: Action of removing a word from a stack. 


PRD: timer period register. A 16-bit register that defines the period for the 
on-chip timer. 


program address bus (PAB): A 16-bit bus that provides the address for 
program memory reads and writes. 


program-address generation logic (PAGEN): Logic circuitry that gener- 
ates the address for program-memory reads and writes, and the address 
for data memory in instructions that require two data operands. This 
circuitry can generate one address per machine. See also data-address 
generation logic (DAGEN). 
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program address register (PAR): A register that holds the address to be 
put on the PAB to address memory for reads via the PB. 


program controller: Logic circuitry that decodes instructions, manages the 
pipeline, stores status of operations, and decodes conditional operations. 


program counter (PC): A 16-bit register that indicates the location of the 
next instruction to be executed. 


program counter extension register (XPC): A register that contains the 
upper 7 bits of the current program memory address. 


program data bus(PB): Abus thatcarries the instruction code and immedi- 
ate operands from program memory. 


program memory: A memory region used for storing and executing programs. 


pulse coded modulation mode (PCM): A bit in the BSP control extension 
register (BSPCE) that enables/disables the BSP transmitter. 


push: Action of placing a word onto a stack. 


RAM overlay (OVLY): A bit in the processor mode status register (PMST) 
that determines whether or not on-chip RAM is mapped into the program 
space in addition to data space. 


RC: See repeat counter. 
REA: See block-repeat end address. 


receive buffer half received (RH): Abit in the BSP control extension regis- 
ter (BSPCE) that indicates which half of the receive buffer has been 
received. 


receive ready (RRDY): A bit in the serial port control register (SPC), 
buffered serial port control register (BSPC), and TDM serial port control 
register (TSPC) that transitions from 0 to 1 to indicate the data receive 
shift register (RSR) contents have been copied to the data receive regis- 
ter (DRR) and that data can be read. 


receiver reset (RRST): A bit in the serial port control register (SPC), 
buffered serial port control register (BSPC), and TDM serial port control 
register (TSPC) that resets the serial port receiver. 


receive shift register full(RSRFULL): Abitinthe serial portcontrol register 
(SPC) and buffered serial port control register (BSPC) that indicates if the 
serial port receiver has experienced overrun. 
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register: A group of bits used for temporarily holding data or for controlling 
or specifying the status of a device. 


repeat counter (RC): A 16-bit register used to specify the number of times 
a single instruction is executed. 


reset: Ameans of bringing the CPU to a known state by setting the registers 
and control bits to predetermined values and signaling execution to start 
at a specified address. 


RH: See receive buffer half received. 

RINT, RINTO, RINT1: See serial port receive interrupt. 
RRDY: See receive ready. 

RRST: See receiver reset. 

RSA: See block-repeat start address. 


RSR: data receive shift register. A 16-bit register that holds serial data 
received from the DR pin. See also data receive register (DRR). 


RSRFULL: See receive shift register full. 


RTN: See fast return register. 


SARAM: single-access RAM. Memory that can be read written once during 
one clock cycle. 


saturation on multiplication (SMUL): A bit in the processor mode status 
register (PMST) that determines whether saturation of a multiplication 
result occurs before performing the accumulation in a MAC or MAS 
instruction. 


saturation on store (SST): A bit in the processor mode status register 
(PMST) that determines whether saturation of the data from the accumu- 
lator occurs before storing in memory. 


serial port interface: An on-chip full-duplex serial port interface that 
provides direct serial communication to serial devices with a minimum 
of external hardware, such as codecs and serial analog-to-digital (A/D) 
and digital-to-analog (D/A) converters. Status and control of the serial 
port is specified in the serial port control register (SPC). 


serial port receive interrupt (RINT, RINTO, RINT1): A bit in the interrupt 
flag register (IFR) that indicates the data receive shift register (RSR) 
contents have been copied to the data receive register (DRR). RINTO 
corresponds to synchronous serial port 0 and RINT1 corresponds to 
synchronous serial port 1. 
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serial port transmit interrupt (XINT, XINTO, XINT1): A bit in the interrupt 
flag register (IFR) that indicates the the data transmit register (DXR) 
contents has been copied to the data transmit shift register (XSR). XINTO 
corresponds to synchronous serial port 0 and XINT1 corresponds to 
synchronous serial port 1. 


shared-access mode (SAM): The mode that allows both the ’54x and the 
host to access HPI memory. In this mode, asynchronous host accesses 
are synchronized internally and, in case of conflict, the host has access 
priority and the ’54x waits one cycle. 


shared-access mode (SMOD): A bitin the HPI control register (HPIC) that 
enables/disables the shared access mode (SAM). See also shared- 
access mode (SAM) and host-only mode (HOM). 


shifter: A hardware unit that shifts bits in a word to the left or to the right. 


sign-control logic: Circuitry used to extend data bits (signed/unsigned) to 
match the input data format of the multiplier, ALU, and shifter. 


sign extension: An operation that fills the high order bits of a number with 
the sign bit. 


sign-extension mode (SXM): A bit in status register 1 (ST1) that enables 
sign extension in CPU operations. 


SMUL: See saturation on multiplication. 


Soft bit: A bit in the serial port control register (SPC), buffered serial port 
control register (BSPC), timer control register (TCR), and TDM serial port 
control register (TSPC) used in conjunction with the Free bit to determine 
the state of the serial port or timer clock when a breakpoint is encoun- 
tered in the high-level language debugger. See also Free bit. 


software interrupt (SINT): An interrupt caused by the execution of an INTR 
or TRAP instruction. 


software wait-state register (SWWSR): A 16-bit register that selects the 
number of wait states for the program, data, and I/O spaces of off-chip 
memory. 


SP: See stack pointer. 


SPC, SPCO, SPC1: serial port control register. A 16-bit register that 
contains status and control bits for the synchronous serial port. SPCO 
corresponds to synchronous serial port 0 and SPC1 corresponds to synchro- 
nous serial port 1. 
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SST: See saturation on store. 


STO: A 16-bit register that contains 54x status and control bits. See also 
PMST; ST1. 


ST1: A16-bit register that contains ’54x status and control bits. See also 
PMST, STO. 


stack: A block of memory used for storing return addresses for subroutines 
and interrupt service routines and for storing data. 


stack pointer (SP): Aregister that always points to the last element pushed 
onto the stack. 


SXM: See sign-extension mode. 


TADD: TDM address. A single, bidirectional address line that identifies 
which devices on the four-wire serial bus should read in the data on the 
TDM data (TDAT) line. 


TC: tesi/control flag. A bitin status register 0 (STO) that is affected by test 
operations. 


TCLK: TDM clock. A single, bidirectional clock line for TDM operation. 


TCR: timer control register. A 16-bit memory-mapped register that 
contains status and control bits for the on-chip timer. 


TCSR: TDM channel select register. A 16-bit memory-mapped register 
that specifies in which of the eight time slots (channels) a device on the 
four-wire serial bus is to transmit. 


TDAT: TDM data. A single, bidirectional line from which all TDM data is 
carried. 


TDM receive interrupt (TRINT): A bit in the interrupt flag register (IFR) that 
indicates the TDM data receive shift register (TRSR) contents have been 
copied to the TDM data receive register (TRCV). 


TDM transmitinterrupt (TXINT): A bitin the interrupt flag register (IFR) that 
indicates the TDM data transmit register (TDXR) contents have been 
copied to the data transmit shift register (XSR). 


TDXR: TDM data transmit register. A 16-bit register used to transmit data 
through the TDM serial port. See also XSR. 
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temporary register (T): A 16-bit register that holds one of the operands for 
multiply and store instructions, the dynamic shift count for the add and 
subtract instructions, or the dynamic bit position for the bit test instruc- 
tions. 


TIM: timer counter register. A 16-bit memory-mapped register that 
specifies the current count for the on-chip timer. 


time-division multiplexed (TDM): A bit in the TDM serial port control 
register (TSPC) that enables/disables the TDM serial port. 


time-division multiplexing: The process by which a single serial bus is 
shared by up to eight 54x devices with each device taking turns to 
communicate on the bus. There are a total of eight time slots (channels) 
available. During a time slot, a given device may talk to any combination 
of devices on the bus. 


timer divide-down register(TDDR): A4-bitfieldin the timer control register 
(TCR) that specifies the timer divide-down ratio (period) for the on-chip 
timer. 


timer interrupt (TINT): A bitin the interrupt flag register (IFR) that indicates 
the timer counter register (TIM) has decremented past 0. 


timer prescaler counter (PSC): A 4-bit field in the timer control register 
(TCR) that specifies the count for the on-chip timer. 


timer reload (TRB): A bit in the timer control register (TCR) that resets the 
on-chip timer. 


timer stop status (TSS): A bit in the timer control register (TCR) that stops 
and restarts the on-chip timer. 


TINT: See timer interrupt. 


TRAD: TDM receive address register. A 16-bit memory-mapped register 
that contains information about the status of the TADD line in the TDM 
serial port. 


transition register (TRN): A 16-bit register that holds the transition decision 
for the path to new metrics to perform the Viterbi algorithm. 


transmit buffer half transmitted (XH): A bit in the BSP control extension 
register (BSPCE) that indicates which half of transmit buffer transmitted. 


transmit mode (TXM): Abitinthe serial port control register (SPC), buffered 
serial port control register (BSPC), and TDM serial port control register 
(TSPC) that specifies the source of the frame synchronization transmit 
(FSX) pulse. 
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transmit ready (XRDY): A bit in the serial port control register (SPC), 
buffered serial port control register (BSPC), and TDM serial port control 
register (TSPC) that transitions from 0 to 1 to indicate the data transmit 
register (DXR) contents have been copied to the data transmit shift regis- 
ter (XSR) and that data is ready to be loaded with a new data word. 


transmit shift register empty (XSREMPTY): A bit in the serial port control 
register (SPC) and buffered serial port control register (BSPC) that indi- 
cates if the serial port transmitter has experienced underflow. 


transmitter reset (XRST): A bit in the serial port control register (SPC), 
buffered serial port control register (BSPC), and TDM serial port control 
register (TSPC) that resets the serial port transmitter. 


TRCV: TDM data receive register. A register used to receive data through 
the TDM serial port. 


TRINT: See 7DM receive interrupt. 
TRN: See transition register. 


TRSR: TDM data receive shift register. A 16-bit register that holds serial 
data received from the TDM data (TDAT) line. See also TRCV. 


TRTA: TDM receive/transmit address register. The lower half of this 
register specifies the receive address of the device; the upper half of this 
register specifies the transmit address. 


TSPC: TDM serial port control register. A 16-bit memory-mapped register 
that contains status and control bits for the TDM serial port. 


TXINT: See 7DM transmit interrupt. 


TXM: See transmit mode. 


wait state: A period of time that the CPU must wait for external program, 
data, or I/O memory to respond when reading from or writing to that 
external memory. The CPU waits one extra cycle (one CLKOUT 1 cycle) 
for every wait state. 


warm boot: The process by which the processor transfers control to the 
entry address of a previously-loaded program. 
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XF: Ageneral purpose, software-controlled, external flag output pin that al- 
lows for signalling external devices. 


XF status flag: Abitinstatus register ST1 that indicates the status of the XF 
pin. 


XH: See transmit buffer half transmitted. 

XINT, XINTO, XINT1: See serial port transmit interrupt. 
XPC: See program counter extension. 

XRDY: See transmit ready. 

XRST: See transmitter reset. 


XSR: data transmit shift register. A 16-bit register that holds serial data to 
be transmitted from the DX pin (or TDX pin when TDM = 1). See also 
TDXR. 


XSREMPTY: See transmit shift register empty. 


ZA: zero detect A. A signal that indicates when accumulator A contains 
a0. 


ZB: zero detect B. A signal that indicates when accumulator B contains 
a0. 


zero detect: See ZA and ZB. 


zero fill: A method of filling the low- or high-order bits with zeros when load- 
ing a 16-bit number into a 32-bit field. 


*(Ik) addressing 
14-pin connector 

dimensions 
14-pin header 


header signals |A-2 
JTAG re 


A/D converter 
definition 
absolute addressing 
5-4 


ABU. See autobuffering unit 

ABU control register 
definition 

ABU receive address register (ARR) 
definition 

ABU receive buffer size register (BKR) 
definition 

ABU transmit address register (AXR) 
definition 

ABU transmit buffer size register (BKX) 
definition 

accessing DRAM blocks {7-28 


accessing status registers 
latencies 


accumulator 


definition 


accumulator A 
definition 
guard bits 


high word |3-27 
low word 


accumulator A high word (AH) 
definition 
accumulator A low word (AL) 
definition 
accumulator access 
no conflict 
one-cycle latency {7-79 
accumulator addressing 5-1} |5-6 
accumulator B 


definition 
guard bits 
high word 
low word 


accumulator B guard bits (BG) 
definition 


accumulator B high word (BH) 
definition 


accumulator B low word (BL) 
definition 

accumulator guard bits (AG) 
definition 

accumulator shift mode (ASM) [4-5 
definition 

accumulator store 
with shift, example 

accumulators [2-7] to 
application-specific instructions 


FIRS 
LMS 


SQDST }-17 
saturation 


shift and rotate operations 
rotate accumulator left |t-16 


Index 


rotate accumulator left with TC 


rotate accumulator right 
shift arithmetically }-16 
shift conditionally 4-16 
shift logically }-16 
storing contents 


Index-1 


Index 


adder 
definition 
address 


definition 
address modification 
bit-reversed 
circular |5-15 
increment/decrement 
indexed 
offset 
address visibility mode (AVIS) |4-7 
definition 
addresses buses 
addressing mode 
definition 
addressing modifications 
addressing program |3-18}to|3-20 
AG register 
AH register |8-27 
AL register |8-27 
ALU BALI 
block diagram 
carry bit (C) 
input sources |4-11 
X input source [4-11] 
Y input source 
ALU input selection example 
ADD instruction, table |4-12 
analog-to-digital converter 
definition 
application(s) 
automotive _|viiil xiii 
consumer \viiil ki 
development support 
general purpose 
medical 
speech/voice viii] fk 
applications 


TMS320 family 


application-specific instructions [4-17] 


ARO-AR7 registers 


definition 


ARAU. See auxiliary register arithmetic unit 
ARAU and address-generation operation 


ARAUs 
definition 


Index-2 


architectural overview 


architecture -1]to 
block diagram 
bus structure 2-3} 


CPU 
internal memory 
arithmetic logic unit. See ALU 


arithmetic logic unit (ALU) 
carry bit (C) |4-13 
definition 


X input source |4-11 
Y input source 
ARP 
See also auxiliary register pointer 
compatible mode 
definition 
latencies 
ARP load 
three-cycle latency 
two-cycle latenc 
zero latency 
ARR 
definition 
ARx updated with no latency 
example 
ARx updated with one-cycle latency 
example 
ARx updated with two-cycle latency 
example 
ASM 
See also accumulator shift mode field 
definition 


ASM bit field 

latencies |7-70 
ASM field 

shift operations [7-69| 
ASM update 


no latency 
one-cycle latency 

assistance 

autobuffering receiver enable (BRE) 
definition 

autobuffering receiver halt (HALTR) 
definition 

autobuffering transmitter enable (BXE) 
definition 

autobuffering transmitter halt (HALTX) 
definition 


autobuffering unit (ABU 

block diagram 

control register |9-43 

definition 

process 

circular addressing registers {9-47 

automotive applications 
auxiliary register 

updating 
auxiliary register file 

definition 
auxiliary register pointer (ARP) 4-2 

definition 
auxiliary register-auxiliary register conflict 


example 


auxiliary register-memory-mapped register conflict 


example 
auxiliary registers 
ARP indexes 
definition 
AVIS 
See also address visibility mode 
definition 
AXR 
definition 


B. See accumulator B 


bank switching aa 1 0-8]to[10-13} 


adding acycle [10-13] 
BSCR Tai 


control register ee 


field description 
example {10-13 
size |10-11 


bank-switching control register (BSCR) 


bit summary 
BNKCMP bits 


diagram 

EXIO bit 

PS-DS bit 
barrel shifter 

See also shifter 
BDRR 

definition 
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BDXR 
definition 
BG register |3-27 


BH [10-10] 


BH register 


BIO 
definition 
pin 
bit-reversed addressing 
auxiliary register modifications 
step/bit pattern relationship 
BK 
See also circular buffer size register 
BKR 
definition 
BKX 
definition 
BL register 
block diagrams 
4x, internal architecture 
arithmetic logic unit ( ear 
circular addressing 
circular buffer ee 
compare select store unit (CSSU) 
direct addressing 
indirect addressing 
dual data-memory operands 5-21] 
single data-memory operand $-12 
memory-mapped register addressing 
multiplier/adder [4-29] 
shifter 
software wait-state generator |10-8 
timer 
block repeat 
counter register 


end address register 
start address register 


block repeat_operation 
looping 23) 

block-repeat active flag (BRAF) |4-4 
definition 

block-repeat counter (BRC) 
definition 

block-repeat end address (REA) 
definition 

block-repeat start address (RSA) 
definition 

BNKCMP 
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Index 


boot 
definition 
bootloader 


considerations when using 
definition 
BRAF 
definition 
BRAF deactivation 
example |7-75 
branch control input (BIO) pin [8-20] 
branch instructions 
pipeline |7-6 
branch instructions in the pipeline 
figure 
branches 
conditional 
far 
ar ES 
BRC 
See also block-repeat counter 


BRE 9-44) 


definition |E-2 
BRINT 

definition 
BRSR 

definition 
BSCR 


definition 
BSP control extension register (BSPCE) 
bit summary 


BXE bit |9-45||E-2 
CLKDV bits an 


BSP data receive register (BDRR) 
definition 

BSP data receive shift register (BRSR) 
definition 
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BSP data transmit register (BDXR) 
definition 

BSP data transmit shift register (BXSR) 
definition 


BSP operation system considerations 


BSP receive interrupt (BRINT) 
definition 

BSP transmit interrupt (BXINT) 
definition 


BSPC 
definition [E-4][E-5] 
BSPCE 


definition 
buffered serial port (BSP) [2-14| 


autobuffering control register 
autobuffering process 
) 


autobuffering unit (ABU 


buffer misalignment interrupt (BMINT) 


definition 

enhanced features |9-37 
power-down mode _ |9-55 
registers |9-35 

system considerations |9-49 


buffered serial port control register (BSPC) 


definition 
DLB bit 
FO bit 

Free bit 
FSM bit 


E-4)[E-5] 


RRST bit 
RSRFULL bit 


Soft bit 
E-18 


XSREMPTY bit |E-19 


buffered signals 
JTAG 
buffering 


burst mode (serial port) |9-18}/E-5 


bus devices 


bus protocol |A-4' 


bus structure 2-3] 
bus usage |2-4 


bus usage 
table 
butterfly 
definition 


definition 
BXINT 

definition 
BXSR 


definition 


c 
definition 
C address bus (CAB) 
definition 
C bus (CB) 
definition 
C compiler |B-2 
C16 
definition 
C2x/C2xx/C5x compatibility (ARP) mode 
C548 
special instructions |3-21 
cable 
target system to emulator A-1] to [A-25] 
cable pod 
call instructions 
pipeline 
calls 
or ea 


far [6-11] 


iy 
carry bit (C 


can pe 
central processing unit (CPU) 

memory-mapped registers |E-12 

circular addressing 
circular buffer_ [5-17] 
diagram 
rules for using 

circular buffer size register (BK) 8-27}, [8-28 
definition 


CLKDV 
definition 


CLKOFF 
definition 
CLKOUT off (CLKOFF) 
definition 
CLKP 
definition 
clock 


changing the multiplier ratio 


CLKMD 8-29} 


clock mode register (CLKMD) 


considerations when using IDLE instruc- 


operation following reset 


operation in IDLE modes 


sources 


8-35 


8-34 


8-29 


crystal resonator circuit 


external clock -26 
switching clock modes 
DIVto PLL 8-32 
PLL to DIV 8-33 
clock mode (MCM) 


definition 


clock mode register (CLKMD) 


bit summar 
diagram [8-29 


PLLCOUNT bits_|8-30] 


PLLMUL bits 
PLLNDIV bit 


PLLSTATUS bit 
clock modes 


mode configurations |8-27 


settings at reset 
5402 


Index 


'041B/'545A/'546A/548/549/'5410 


sources 
clock polarity (CLKP) 
definition 
CLOCKOUT off (CLKOFF) 
CMPT 
definition 
code 
definition 
code generation tools 
cold boot 


definition 


4-7 


Index-5 


Index 


compare, select, and store unit connector 
(CSSU) |4-26|to 14-pin header |A-2 
See also CSSU dimensions 


definition mechanical 


compatibility mode DuPont 


indirect addressing mode consumer applications xii 
instruction format 6-24 continuous mode (serial port) 
instruction format control applications 


control registers 
external bus |10-5 
counter down-time 


compatibility mode (CMPT) |4-5 
definition 


compiler |B-2 PLL multiplication factors |10-26 
compiler mode CPL 


latencies for SP definition 


SP latencies 
iler mode (CPL) [4-4 cru B-vjtob-9 
vO ee = accumulators |2- [4-15] 
definition ALU 
condition groupings See also ALU 
table arithmetic logic unit 
ces See also ALU 
CApeinoney Oran nee: compare, select, and store unit (CSSU) |2-9) 
delayed [4-26] 
instructions 


components 


CSSU 


exponent encoder 


nondelayed 


conditional calls 


see Itiplier/add 

instruction muluplieradder 

nondelayed shifter 

ss CPU components 
conditional execute CPU registers 
conditional operations |6-16]to|6-19 CSSU_ [2-9 

branch [6-7] diagram |4-26| 

call |6-10 functions 

conditions using CMPS instruction 

execute Viterbi operator |4-27| 


return 
store 
XC instruction Dp) 


conditional returns |6-13) 


with ALU operations |4-27 


delayed pet a 
instruction efinition |E- 


nondelayed D bus (DB) 


definition [E-7| 
conditional store 
conditions for 


DAB address register (DAR) 
definition |E-6 
instructions DAGEN a 


configuration DAGEN register 
multiprocessor |A-13 address conflicts, rules [7-44] 
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DAR. See DAB address register 
DARAM blocks 
table 
data address bus 
definition 
data address generation (DAGEN) 
instructions that access in read stage [7-38] 
data addressing 
five modes 2-9) 
introduction [5-1] 
data bus 
definition 
data buses 
data memory to 
accumulators 
auxiliary registers 
block-repeat registers 
circular buffer size register (BK) |8-28 
configurability 
CPU registers 
definition 
interrupt registers 
on-chip advantages 


processor mode status ster ( (PMS 


program counter extension (XPC) eal 
stack pointer (SP 
status registers 
table pj 
temporary register (T) |3-28 
transition register (TRN) [8-28 
data memory page pointer (DP) |4-3 
definition 
data receive register (DRR) 
data receive shift register (RSR) |9-5 
definition 
data ROM (DROM) 
definition 


data security 


data transmit register (DXR) |9-5 


data transmit shift register (XSR) 
definition 


data types 
16-bit 


32-bit |5-28 
data-address generation logic (DAGEN) 


definition |E-6| 


debug tools |B-2 


debugger. See emulation 
delayed branch instruction in the pipeline 
development support applications 


development tools 
device nomenclature 
diagram 
prefixes |B-5 
diagnostic applications 
digital loopback mode 
definition 
digital loopback mode (DLB) bit 
definition 
dimensions 
12-pin header |A-20 
14-pin header |A-14 
mechanical, 14-pin header |A-14 


direct eo b-9| 5-1] 5-7| 


diagram 


DP- on 5-9} 
instruction format 


instruction word fields 


opcode p-8 
SP-referenced 


direct data-memory address bus 
definition 

direct data-memory address bus (DRB) 
definition 

direct memory address 
definition 

direct-addressing mode 
DP 

DLB 
definition 


dma |E-8 

DMA sub-addressed registers 
5420 

dmad addressing 

DP 


definition 

direct-addressing mode |7-63 

latencies {7-64 
DP load 

three-cycle latency 

two-cycle latenc 

zero latency 


Index 
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Index 


DP-referenced direct addressing 
diagram 
DRAM blocks 
access |7-28 
DROM 
definition 
DROM setup 
followed by a dual-read access [7-78] 
followed by aread access {7-78 
DRR 


definition 
DSP 
articles kh 


DSP interrupt (DSPINT) 
definition 


DSPINT 
definition 
DSPINT and HINT function operation [8-51] 
dual 16-bit mode [4-14 
dual 16-bit/double-precision arithmetic mode 
(C16) 
dual access memory 


dual data-memory operand addressing 


auxiliary registers 
diagram 
indirect addressing mode 
diagram 
instruction format 6-20 
instruction format 
types of 
using Xmem 
using Ymem 


dual operands 
circular |5-22 
increment/decrement 


indexed 
single-operand instructions {5-22 
dual-access RAM (DARAM) {2-6 
definition 
DuPont connector |A-2 
DXR 


definition 


E address bus (EAB) 
definition 
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E bus (EB) 
definition 


EAB address register (EAR) 
definition 


EMUO0/1 
configuration 
emulation pins |A-20 


IN signals 

rising edge modification 
EMUO/1 signals A-6 
emulation 

JTAG cable 

timing calculations [A-7|to[A-9|[A-18]to 
emulator 

connection to target system 

JTAG mechanical dimensions to 

designing the JTAG cable 

emulation pins 

pod interface |A-5| 


signal buffering [A-10|toA-13| 


target cable 


header design f-2)to[A-3 


emulator pod 


timings |A-6 
enabling the timer 
execute 


conditional 

execute interrupt service routine (ISR) 
interrupt context save |6-34| 
interrupt latency 

EXIO 

EXP encoder 
definition 

exponent encoder |4-29 


definition 
figure 

extended program memory 
paged |3-21 


external bus 
hold mode 
IDLE3 wake-up sequence 
interface 
interrupts 
prioritization 
reset |10-29 
timing 
V/O access 
memory access 
reset 


external bus control registers |10-5 
external bus interface [2-16] 
external bus operation 

introduction [10-1] 
external flag output (XF) pin 
external interface 

key signals, table |10-2 


far branches |6-8 
instructions [6-8 
unconditional |6-8 
far calls 
instructions 
unconditional |6-11 
far returns 
instructions 
unconditional 
fast Fourier transform (FFT) |E-8 
fast return register (RTN) 
definition 
FE 
definition 
FIG 
definition 
FO 
definition 
format (FO) 


definition 


format extension (FE) 
definition 

fractional mode (FRCT) |4-5) 
definition 


frame ignore (FIG 
definition 

frame synchronization mode (FSM) 
definition 

frame synchronization polarity (FSP) 
definition 

FRCT 
definition 

Free bit 9-9) 
definition 

FSM 11] B-13 
definition 
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FSP 


definition |E-9 


general-purpose applications 
generator 


clock 
wait-state 


graphics/imagery applications k] 


half-cycle accesses 
instruction performing dual-operand write [7-28] 
instruction performing operand read/write 
instruction performing single-operand read 
instruction performing single-operand write 
instruction word prefetch F728] 


half-cycle accesses to dual-access memory 


HALTR 
definition 

HALTX 
definition 


hardware 
block diagram 
timer [2-12 
Harvard architecture |1-5 
header 
14-pin 
dimensions, 14-pin |A-2 
HINT 
definition [E-10| 
HM 
definition [E-10] 
HOLD and HOLDA minimum timing 
hold mode 
hold mode (HM 
definition 
HOM 
host-only mode 
host port interface 
block diagram 
control register bit descriptions 
definition 
details of operation 
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Index 


host port interface (continued 
functional description 
generic block diagram 
host read/write access _|8-46 
input control signals 
memory access during reset |8-54 
memory access mode, SAM/HOM 
operation during reset 
register description 
signal names and functions |8-41 
timing diagram 

host processor interrupt (HINT) 
definition 

host-only mode (HOM) 

host-port interfaces 
table |2-12 

HPI. See host port interface 

HPI address register (HPIA) |E-10 

HPI control register (HPIC) 
’B4x reads from HPIC 
54x writes to HPIC 
DSPINT bit 
HINT bit |E-10 


host reads from HPIG 
host writes to HPIC |8-46 
SMOD bit 

HPI modes 


host only (HOM) _ |E-10 
shared access (SAM) |E-16 


1/O 


access timing 
pins_[p-20] 
BIO pin __ 
branch control input (BIO) pin _8-20 
external flag output (XF) pin 8-20 
XF pin B-2d 
ports 
parallel _P-16 
serial 
1/0 memory 
I/O pins 
BIO 
XF 
IDLE1 mode 
IDLE2 mode {6-51 
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IDLE3 mode {6-51 
IDLE3 wake-up sequence 
IEEE 1149.1 specification 
bus slave device rules 
IEEE standard 1149.1 2-16 
IFR 
definition 
immediate addressing [2-9}|5-1}|5-2 
instructions, table 5-2} 
long 6-2} 
short 
IMR 
definition 
INO 
definition 
definition 
indirect addressing [2-9} 5-1] 
address roifeaters 5-13)|5 
address- a eperation | 
ARAU_ {5-11} 
assembler syntax [5-23] 
diagram ESI 
dual-operand eae [5-19] 
instruction format 
compatibility mode 6-24 
dual data-memory operands 5-20 
single data-memory operand 6-10 
instruction word fields 


D-20 
Ymod 5-20 
single-operand addressing [5-10][5-13] 
initialization 
timer 
input 0 (INO) 
definition [E-10] 
input 1 (IN1) 
definition [E-10] 
input sources 
ALU /|4-11 
multiplier |4-22 
instruction fetch and operand read 


figure 


instruction register (IR) 
definition 


instructions 
multiconditional |6-17 


internal memory 
on-chip 
dual-access RAM (DARAM) -4 
ROM 
security 
single-access RAM (SARAM) -4 
organization 
internal transmit clock division factor (CLKDV) 
definition [E-11] 
interrupt 
definition [E-11] 
interrupt flag register (IFR) |3-26}|6-27 
BRINT bit 


BXINT bit 
definition |E-11) 
diagram 
RINT bit 
TINT bit 


TRINT bit 


diagram 


interrupt mode (INTM) 4-4 
definition 

interrupt operation [6-35] 
diagram 

interrupt phases 


interrupt service routine 
definition 


interrupt mask register (IMR) |38-26}|/6-29 
definition 


interrupt tables |6-38 


interrupt vector address generation 
diagram 


interrupt vector pointer (IPTR) |4-6 
definition 

interrupts |6-26|]10-29) 
hardware 
interrupt flag register (IFR) 
interrupt mask register (IMR) 
latency time 


NMI 
nonmaskable 
reset 

RS interrupt 
saving data |6-34 
soft reset 
user-maskable (external) |E-8 


interrupts phases 


acknowledge interrupt 
execute interrupt service routine |6-33 
receive interrupt request 
INTM 
definition [E-11] 
introduction |1-1]to|1-8 
features |1-6 
TMS320 family overview |1-2 
TMS320C54x overview |1-5 
IPIRQ 
interprocessor interrupt request bit 
IPTR 
definition [E-11] 
latencies, table |7-76 
IPTR setup 
followed by a software trap 
IR 


definition [E-11] 


JTAG 

JTAG emulator 
buffered signals 
connection to target system |A-1|to|A-25 
no signal buffering ESI 


Index 
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Index 


latencies 
accessing ARx 
accessing BK 


auxiliary register 
DROM bit, table 
store instructions {7-39 
latencies for SP 
compiler mode {7-51 
non-compiler mode (CPL =0) {7-55 
latency 
definition 
explanation of 
least significant bit (LSB) 
definition 
logic/arithmetic operations 
multiconditional instructions 


long-immediate addressing 
RPT instruction 


MAC and MAS saturation 


maskable a 


ee 


MCM oe 


settee =a 
medical applications 
memory 


data memory 

data security |3-30| 

extended program a [3-20] 
I/O access timing 
introduction eg, Hol 

memory access pig 
memory space |3-2|to|3- 
program memory ecm 


extended program memory B-2q 
word order 5-29] 

memory maps 
BAI 
542 
543 
B45 
B46 
548 
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memory maps_ (continued) 


5402 
5410 
5420 
definition |E-11| 


extended program 
548 and’549 B-g 
‘5402 3-10 
5410 8-12 
‘5420 -14] 
memory security 
memory space B-2]to 8-14] 
memory-mapped registers 
conflict example 
instructions for accessing [7-35] 


memory-mapped register addressing 
5-251 
diagram 


instructions 


LDM 6-26 
MVDM_ 5-26 
MVMD_ $-26 


2) 
D 


POPM 6 
PSHM 6 
STLM 
STM 6-26 
memory- mapped registers 2-6] [3-25] 
defined 
Seer ES 
micro stack 


definition 
microcomputer mode 

definition 
microprocessor mode 

definition 
microprocessor/microcomputer (MP/MC) 

definition 
military applications 
mode selection 

clock modes |8-27 
most significant bit (MSB) 

definition [E-12] 
MP/MC 

definition [E-12] 

latencies, table |7-76 
MP/MC setup 


followed by an unconditional delayed call [7-77] 


multi-channel buffered serial port (McBSP) 
definition 
multi-channel buffered serial ports [2-15] 
multi-cycle instructions 
transformed to single-cycle 
multimedia applications 
multiplier 
definition 


multiplier/adder 4-21} to 4-24] 


block diagram 

input sources 
multiplier input selection, table 
multiply/accumulate (MAC) instructions 
multiply/subtract (MAS) instructions 
square/add (SQRA) instructions 


square/subtract (SQRS) instructions 


nested interrupt |E-12 


nomenclature_|B-6} 
prefixes 
nonmaskable interrupt 


normalization of accumulator A 
example 


on-chip 
dual-access RAM (DARAM) 
peripherals 
ROM 
security 
single-access RAM (SARAM) 


on-chip DARAM 

on-chip data memor 
available, table 

on-chip memory 


advantages 
available, table 


on-chip peripherals 


buffered serial port (BSP) |9-33 
serial port interface |9-4| 
TDM serial port 


on-chip RAM 


organization 


on-chip ROM 
figure |3-17 
organization 
program memory map 
figure 

on-chip ROM contents 

operand write and operand read conflict 
figure 

output modes 
external count_|A-20 
signal event 


OVA 

definition [E-12] 
OVB 

definition [E-13| 
overflow 

definition [E-13] 
overflow flag 


definition [E-13] 
overflow flag A (OVA) /4-3 

definition [E-12] 
overflow flag B(OVB) /4-3 

definition 
overflow handling 


overflow mode (OVM) 
definition 


overview 


architecture 
TMS320 family 
TMS320C54x 
OVLY 
definition [E-14] 
latencies, table |7-76 
OVLY setup 
followed by a conditional branch 
followed by a return 
followed by an unconditional branch 
OVM 


definition [E-13] 


PA addressing 


PAB 
definition [E-13] 
PAGEN 


diagram 


PAL 
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Index 


parallel I/O ports |2-16 
part numbers 
tools 
part-order information |B-5 
PB 
definition [E-14] 
PC 
definition [E-14] 
PCM 
definition 
peripheral control [8-2|to|8-19] 
peripheral memory-mapped registers 
541/’541B (8-3 
542 
B43 
'545/’545A_ (8-6 
*546/546A_ |8-7 
548 a 


peripherals [8-1]to|8-26] 
bank switching 
buffered serial port (BSP) [9-33] 
clock generator 
clock modes 
control 
general-purpose I/O pins 
hardware timer 
host port interface 

) 


host port interface (HPI) |8-37, 


parallel I/O ports |2-16 


programmable bank switching 
serial I/O ports 


pipeline (continued 


call instruction 


call instructions {7-8 
CC instruction 


CCD instruction 

conditional call/oranch instructions _|7-20 

conditional-execute instructions {7-19 

definition 

delayed call instruction 

delayed return instruction 

delayed return with interrupt enable 
instruction 

delayed return-fast instruction [7-18] 

instructions 

interrupt response 

INTR instructions 

introduction 


latency 


ingeneral V-35 
precautions [7-35 
store instructions 
types of 

levels 

operation 

return instruction 

return instructions 


return with interrupt enable instruction [7-15] 


return-fast instruction 
six-level structure 


XC instruction 


pipeline latencies {7-35 
pipeline levels/functions 


access 
decode 


execute/write {7-2 
program fetch 
program pre-fetch 
read 


serial port interface [9-4 pipeline operation [2-10] 
software-programmable wait-state pipeline stages 
generator |10-5 figure 


TDM serial port |9-56 
timer ia2i] 


wait-state generator 


pipelined memory accesses 
instruction performing dual-operand read 
instruction performing dual-operand write {7-4 


pins instruction performing operand read and 
V0 wite [4 
pipeline instruction performing single-operand read 


BC instruction 


instruction performing single-operand write |7-4 
BCD instruction |7-24 ral 


instruction word fetch 
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pipeline-protected instruction 


update T register |7-57 
write to ASM 


PLL 
changing the multiplier ratio [8-34] 
considerations when using IDLE 


instruction 

hardware-configurable 
operation following reset 
operation in IDLE modes 
programmable lock timer |8-31 
programming considerations 
software programmable 
switching clock modes 


DIVto PLL 8-32 
PLL to DIV 8-33 


PLL lockup time versus CLKOUT frequency 
pmad 

definition [E-13] 
pmad addressing 


PMST 
See also processor mode status register (PMST) 
definition 
latencies {7-75 


polarity bit 


clock 


frame sync {9-38 
pop 


definition |E-13] 
power-down mode (6-50|to[6-52] 
disabling external interface internal clock 
Hold mode 
IDLE 1 instruction 
IDLE 2 instruction 
IDLE 3 instruction 
initiated using HOLD signal 
invoking i-50] 
other power-down capabilities |6-52 


power-down modes 


operation during 


PRD 
definition |E-13] 
prioritization 


external bus |10-4 


Index 


processor mode status register (PMST) |38-27 


AVIS bit 
bit summary 
CLKOFF bit 


definiti 


diagram 
DROM bit 


IPTR f 


OVLY 
SMUL 


on 


ield 


bit 
bit 


program address bus (PAB) 


definiti 


on [E-i3| 


program address register (PAR) 


definiti 


on |E-14] 


program addressing 
introduction 


program bus 


program control 
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